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Abstract —We consider the problem of distributed scheduling 
in wireless networks where heterogeneously delayed information 
about queue lengths and channel states of all links are available 
at all the transmitters. In an earlier work (by Reddy et al. in 
Queueing Systems, 2012), a throughput optimal scheduling policy 
(which we refer to henceforth as the R policy) for this setting was 
proposed. We study the R policy, and examine its two drawbacks 
- (i) its huge computational complexity, and (ii) its non-optimal 
average per-packet queueing delay. We show that the R policy 
unnecessarily constrains Itself to work with information that is 
more delayed than that afforded by the system. We propose a new 
policy that fully exploits the commonly available Information, 
thereby greatly improving upon the computational complexity 
and the delay performance of the R policy. We show that our 
policy is throughput optimal. Our main contribution in this work 
is the design of two fast and near-throughput-optimal policies for 
this setting, whose explicit throughput and runtime performances 
we characterize analytically. While the R policy takes a few 
milliseconds to several tens of seconds to compute the schedule 
once (for varying number of links in the network), the running 
times of the proposed near-throughput-optimal algorithms range 
from a few microseconds to only a few hundred microseconds, 
and are thus suitable for practical implementation in networks 
with heterogeneously delayed information. 

Index Terms —Distributed Scheduling, Heterogeneous Delays, 
Network State Information, Throughput Optimality 


I. Introduction 


A central problem in any wireless network is one of 
scheduling the different users in the network with the ob¬ 
jective of optimizing some desired metric - for example, max¬ 
imizing the system throughput - in the presence of challenges 
that are unique to the wireless medium - namely, channel 
fading and interference due to transmissions from other users 
in the network. This problem has been studied extensively 
in the literature. A highly influential and often cited work in 
this area is the work by Tassiulas and Ephremides Q, who 
proposed the Back-Pressure scheduling algorithm (a version 
of the Max-Weight algorithm Q, 0), which is a centralized 
algorithm that schedules the links in the network based on 
global knowledge of the instantaneous queue lengths at all 
the links. Even though this algorithm is provably throughput 
optimal (see Sec. II-E| l, it is a centralized algorithm that 
requires solving a global optimization problem in each time 
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slot, and it also requires knowledge of instantaneous queue 
lengths at all links in the network to determine the schedule 

Q ©■ 

The Max-Weight algorithm, being a centralized policy, 
involves the computationally costly task of hnding a maximal 
independent (i.e., non-interfering) set of links that can be ac¬ 
tivated simultaneously and whose summation of link-weights 
is maximum 0 - To circumvent this limitation, researchers 
have considered two broad approaches 0 - namely, design 
of random-access algorithms in which access probabilities are 
dependent on queue sizes 0 or on arrival rates g-ig, and 
design of distributed implementations of the Max-Weight algo¬ 
rithm 04), ng. Some of these approaches require knowledge 
of instantaneous queue lengths and/or instantaneous channel 
states to attain their objective. 

Even though it may be reasonable to assume that each node 
has knowledge of instantaneous queue lengths and channel 
states (at all times) for its own links (links emanating from 
itself), it is less pragmatic to assume that any node possesses 
instantaneous information about any link in the network other 
than its own links (at any time instant). This could be because, 
for example, these quantities vary quickly with time (e.g., 
fast fading), or because the propagation delay of the feedback 
channel is large GD- In |17) , the authors consider networks 
where each node possesses knowledge of instantaneous queue 
lengths and channel states for its own links, but only has 
these information from other links in the network with some 
globally hxed delay (commonly referred to as homogeneous 
delay). The assumption of homogeneous delays, however, is 
not satished in many networks where there is often a mismatch 
in the delays with which each node can acquire information 
about queue lengths and channel states of other links in the 
network 0- An example of such a case is a network where 
the “quality” of information that a node possesses, of queue 
lengths and channel states of links other than its own direct 
links, is a decreasing function of their “distances” from this 
node IjT]. Non-homogeneous delays are commonly referred to 
as heterogeneous delays. 

One serious issue with heterogeneous delays is that the 
nodes could potentially have different views of the state of the 
network 0- In 0, the authors consider distributed scheduling 
in a wireless network where information about the queue 
lengths and channel states of links in the network, available at 
the nodes, are heterogeneously delayed. They characterize the 
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system throughput region for this setting, propose a scheduling 
policy, and show that their policy is throughput optimal. We 
study the limitations of the policy proposed in ||T|. 

As in ||Tj, we consider the problem of distributed scheduling 
in wireless networks where delayed queue length and delayed 
channel state information of each wireless link in the network, 
are available at all the transmitters in the network, but with 
possibly different delays. We refer to these as heterogeneously 
delayed queue state information (QSI) and heterogeneously 
delayed channel state information (CSI) respectively, and 
collectively refer to them as heterogeneously delayed network 
state information (NSI). We refer the reader to Sec. [IkD] for 
detailed information about the structure of heterogeneously 
delayed NSI that we have considered in this work. Our 
contributions are summarized below. 

A. Our Contributions 

Our primary contribution in this work is the design, analysis 
and performance evaluation of two fast and near-throughput- 
optimal scheduling policies for the case of networks with het¬ 
erogeneously delayed NSI. Starting with the policy proposed 
in Q (which we refer to henceforth as the R policy), we arrive 
at our fast and near-throughput-optimal policies through the 
following steps of identifying issues with the R policy and 
making progressive refinements to address these issues; 

1) First, we study the limitations of the R policy. The R policy 
is formulated as a functional optimization problem that 
searches for optimal threshold functions (see Sec. that 
maximize the queue-length weighted aggregate expected 
throughput of the system. We study the computational 
complexity of the R policy and note that the R policy 
is computationally very costly - (i) it needs to evaluate a 
number of functions in its optimization domain that grows 
as a double exponential in the number of links in the 
network, and grows as an exponential in the number of 
levels into which the channel states on the wireless links 
are quantized, and (ii) it needs to consider, in computing the 
expected rate, a number of sample paths that is exponential 
in the number of links in the network and in the maximum 
of the heterogeneous delay values. 

2) We show that the delay performance of the R policy is non- 
optimal, and that it can be improved upon significantly. 

3) Next, we show that the structure of heterogeneously de¬ 
layed NSI as defined in the system model in ||Tj affords 
each node access to NSI that is less delayed than that 
used in |[T], and yet commonly available at all nodes 
in the network. We propose two modifications to the R 
policy to make use of this less delayed NSI (we call the 
resulting policy the H policy). We show that the H policy 
hugely improves upon the computational complexity of 
the R policy, and numerically show that the H policy 
also improves upon the delay performance of the R policy 
substantially. We establish that, like the R policy, the H 
policy too is throughput optimal. 

4) Finally, we show that despite the huge leap that the H 
policy takes in reducing the computational complexity in 
comparison to that of the R policy, the computational com¬ 
plexity of the H policy still remains impracticably large. 


making the case for low-complexity scheduling policies. 
Taking design cues from the H policy, we propose two low- 
complexity near-throughput-optimal algorithms that are 
several orders of magnitude faster than the R and H 
policies. We obtain explicit analytical expressions for the 
expected saturated system throughputs of these policies, 
evaluate their performance and show that these policies 
closely approximate the optimal throughputs of the R 
and H policies. We also show that these policies possess 
desirable queueing delay characteristics. 

II. System Model 

Our system model is precisely the same as that used in 
Q. We use a slotted-time system and restrict our attention 
to single-hop transmissions. We borrow heavily in definitions, 
notations and nomenclature from Q to keep our presentation 
easily cross-referenced and compared with ||T]. 

A. Network Model 

Our model of the wireless network has L transmitter- 
receiver pairs (or links); the set of links is denoted C. We 
abstract the channel condition on each of the wireless links 
by the link’s capacity. We model the time-varying capacity of 
each link I as a separate discrete-time Markov chain (DTMC) 
denoted {(7;[f]} on the same state space C = {ci, C 2 ,. .. cm}, 
where ci < C 2 < ... < cm are n on-n egative integers, and 
with the same transition probabilitie s *p| We assume that the 
channel conditions on the wireless links are all independent 
of each other, but identically distributed, with transition prob¬ 
abilities pij := Pr[Ci[t -f 1] = Cj I Ci[t] = Ci]. We assume 
that these DTMCs are irreducible and aperiodic, and therefore 
have a stationary distribution, with the stationary probability 
of being in state Cj,j G {1, 2,..., M} denoted 7r(cj). 


B. Interference Model 

For each link I, we let Ii denote the set of links in the 
network that interfere wit h tra nsmissions on link Z; thus, // U 
{Z} is an interference We say that a collision occurs 

with a transmission scheduled on link Z if, in the same time 
slot, a transmission is scheduled on at least one link I' G f. 
When a link Z successfully transmits (i.e. when the packets 
transmitted on link Z do not encounter a collision), a number 
of packets equal to Ci[t] in the case of saturated queues, and 
equal to min(C'i[f], Qi[t]) in the case of non-saturated queues, 
are successfully received by the receiver on the other side of 
the link. However, in the case of a collision, yiCilt] packets 
in the case of saturated queues, and mva{'yiCi\f\,Qi\t]) in the 


'We use C to denote both the set and its cardinality. 

^We remai'k that the above channel model is assumed for making notations 
simpler and to enhance clarity. Our results hold even for the case of networks 
where each link is modeled as a separate DTMC (with different state spaces 
and/or different transition probabilities). 

can capture arbitrary interference constraints. 

'^An interference set is a set of wireless links such that if there is 
transmission on more than one link in the set in the same time slot, then 
these transmissions interfere with one another, possibly resulting in only a 
part of the transmission being received successfully at (any of) the intended 
receiver(s). 
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TABLE I: An instance of (small) delay values for a wireless network 
with three links 



At TX At 

At TX fit 

At TX Ct 

Delay in obtaining NSI of link li 

0 

1 

3 

Delay in obtaining NSI of link I2 

2 

0 

4 

Delay in obtaining NSI of link Z3 

1 

2 

0 


i 3 , C are transmitter nodes of links /i, /2, and respectively. 


case of non-saturated queues, where 7 ; G [ 0 , 1 ], are received 
at the intended receivers 0 

C. Traffic Model and Queue Dynamics 

As noted earlier, we use a slotted-time model. Packets 
arriving at a transmitter, depending on their intended desti¬ 
nation, are assigned a wireless link. Prior to transmission on 
this link, these packets are buffered in the queue associated 
with this link. We denote by Qi[t] the length of the queue 
associated with link I, at time t. We model the packet arrivals 
into the queue associated with link I, by an arrival process 
denoted Ai\t], that describes the number of packets that arrive 
into the queue, at time t. For every link I, we assume that 
Ai\f\ is an integer-valued process, independent across time 
slots t, with 0 < Ai[t] < Ajnax < 00 almost surely, with 
A; := < 00 . The queue lengths are governed by the 

update equation: 

Qi[t -f 1] = {Qi[t] + Ai\t\ — ( 1 ) 

where Si[t] := Ci[t] is the maximum possible service rate of 
link I at time t, and := max( 0 ,a:). 


D. Information Model and Structure of Heterogeneously De¬ 
layed NSI 

We use precisely the same structure for heterogeneously 
delayed NSI that each node has access to, as that used in ||T]. 
In our model, at time t, the transmitter node of each link I 
has information of the current queue length and the current 
channel state of link I, but only has delayed queue length 
and delayed channel state information of other links in the 
network, where these delays are possibly heterogeneous but 
time-invariant. As an example, consider the delay values in 
Table H] for a wireless network with three links li, I 2 , I 3 with 
A, B, C as the transmitter nodes of these links respectively 
such that they form a network with complete interference^ 
From the first column of this table, transmitter A has NSI of 
link li (a link emanating from transmitter A) with a delay of 0 
time slots, NSI of links I 2 and (3 with delays of 2 and 1 time 
slots respectively. Similarly, transmitter B has NSI of links li, 
I 2 (a link emanating from transmitter B), and (3 with delays of 
1, 0, and 2 time slots respectively, and transmitter C has NSI 
of links li, I 2 , and (3 (a link emanating from transmitter C) 
with delays of 3, 4, and 0 time slots respectively. Analogously, 
transmitters A, B and C possess the NSI of link li with delays 
of 0, 1, and 3 time slots respectively, of link I 2 with delays of 

^We assume that {■yi}i£C are such that {yici,... ,'yiCM}leC for 7; S 
[0,1] are all integers. 

network is said to have the complete interference property if each link 
in the network is in the interference set of all other links in the network. 


2, 0, and 4 time slots respectively, and of link I 3 with delays 
of 1, 2, and 0 time slots respectively. Additionally, the entire 
table of time-invariant delay values is assumed to be available 
at all transmitters in the network. 

We will need some notations. Let Ti{h) denote the time- 
invariant delay with which the NSI of link h is available 
at the transmitter node of link 1. For example, referring to 
the (2,3) entry of Table Tisih) is 4 . Let Tmax denote 
the maximum of the delay values across the whole network, 
i.e., Tmax ■= ^axi,h(^c,i^hTi{h). Also, let n^max denote the 
maximum of the delays with which the NSI of link I are 
available at the transmitter node of any link in the network, 
i-e., Ti^max ■= tnaXhec,i^hTh{l)- In other words, referring 
to the table of delay values, Tmax refers to the maximum 
of all the entries in the table {Tmax = 4 in Table [I]l, and 
Tii,max'^h G T refers to the maximum of the entries in row 
i in,,max = 3, Ti^^max = 4 , and Ti^^max = 2 in Table |^. We 
note that r/(() = 0 V( € £ in our model (the diagonal entries 
in Table [I]i. 

Let : r) := {Ci[t - T],Ci[t - t-G 1],. ■ ■ 

C[f](0 ■ fmax) ■— ■ 't'max^^lGC and C[f](0 

T^niax) ■■= : Ti^max)}lGC- Let Q[f](0 : Tmax) and 

Q[f](0 : T max) be similarly defined. We denote the NSI 
available at the transmitter node of link I by {7^i(Q[f](0 : 
T,max)),'Pi{C[t]{0 : T^max))}, where 

Vi{Q[t]{0: {P/m(QM(0 

VlrniQim ■ r.,max)) := {Qm[t - 

Vl{C[t]{0 : T.^max)) ■= {Vlm{C[t]{Q T.^max))}m(iC, and 
Vlm{C[t]{0 : T^max)) := {Cm[t - 

( 2 ) 


Example 1\ With reference to the delay values in Table [T] the 
NSI available at transmitter A (the transmitter node of link Zi), 
{'Pzi(QM(0 : r.^max)), (c[f](0 : r.^rnax))} is the set: {{Qi^[t- 

3],... ,CiAt],CiAt - 4], ■ ■ ■ ,Cift - 2],Cift - 2],Cift - 1]}}. 


Note that, in 0 , the NSI available at link I is defined 
as {Vi{Q[t]i0 viicm 

'^max ))}, and as a 

consequence, even though the structure of NSI that we use 
in our work is the same as that in 0 , the NSI at link I in our 
model is a subset of the corresponding NSI used in ||T| since 

for each /, Ti max — t'max- 

It is crucial to note that {Qi[t - Ti^max],Ci[t - Ti^max\}i(^c 
are all common information available at the transmit¬ 
ter node of each link in the network, as are {Qi[t — 
Ci[t-Tmax]}i(iC- It is intuitively appealing that {Qi[t- 
'^l,max\ 1 Ci[t — Ti^maxWi^c, being less delayed (i.e., more 
recent) compared to {Qi[t-Tmax],Ci[t-Tmax\}i(^c, and also 
being commonly available at all transmitters in the network, 
it is preferable that each transmitter in the network base its 
transmit/no-transmit decision on this information rather than 
on {Qi[t - Tmax],Ci[t - Tmax]}i&c- In Sec. 1^ we show that 
this intuition is indeed correct. 
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E. Performance Metric 

The metric of interest to us is the throughput reg/of£] (except 
in Sec. |VII| where we are interested in the saturated system 
throughpuj^. For this, we define the state of the network at 
time t as the process Y[t] := {Qi[t]{Q : Ti^rnax),Ci[t](0 : 
Ti,max)}iec- We denote Y[f] under the scheduling policjH 
by It is easily seen that this process is a DTMC. 

Given an arrival rate vector {Ai}/g£, we say that the 
network is stochastically stable under the scheduling policy IF 
if the DTMC Y'^ [f] is positive recurrent. We say that an arrival 
rate vector is supportable if some scheduling policy 

makes the network stochastically stable for this arrival rate 
vector. We say that a scheduling policy is throughput optimal 
if it stabilizes the network for any arrival rate vector that is 
supportablep^ 


F. Characterization of System Throughput Region 

Consider the collection of functions {/;}/££, where /; : 
'Pi(C[t](0 : T^rnax)) {0,1} has the following semantics 
- in each time slot t, the transmitter associated with each 
link I, computes the binary value //(7^;(C[f](0 : r^max))) and 
attempts to transmit on link I if and only if the outcome is 1 . 
Note that the outcome of fi is independent of all queue-length 
information. 

Let ^((Cjf) be the expected data transmission rate (i.e. the 
expected number of bits or packets transmitted per time-slot) at 
time t that the transmitter associated with link I would receive 
if it applies the scheduling policy fi (where f := {fi}i£c) 
when the delayed CSI at time f is C[f — r^max] = c. That is, 


^;(c,f) = E 


Ci[t]fi{Vi{.)){yi + {l-^i) 






(3) 


where C[t - r.^max] ■= {Ci[t - Ti^max]}iGC, and Vi{.) := 
'Pi{C[t]{0 : T.^max))- Let S(c,f) = {5';(c,f)}ig£. Also let 


the threshold function vectors f. Let A C as defined in 
Equation be our candidate for the throughput region of 
the system. 

A = (A : A = ^ 7r(c)a:(c), a;(c) G ? 7 (c)} (5) 

cSC^ 

III. Brief Description of the R Policy 

To put our work (which we develop starting in Sec. |IV| l in 
context, we briefly describe the R policy proposed in and 
investigate its two drawbacks in this section. For the R policy, 
given C[f](0 : T^ax), the critical NSI of the network at time 
t, is defined aj^ 

C5^(C[f](0 : Tmax)) '■= {{Cl[t — Tk{l)]}k£C,k=y^l}l&C^ ( 6 ) 

and the critical NSI available at the transmitter node of link I 
as 

(C[f](0 : Tmax)) := C5'^(C[f](0 : w)) 

n V[^{C[t]{0 : Tmax)), where 

(7) 

: Tmax)) := {V^icmo : Tmax))}m^CFnd 
'^;m(C[f](0 : Tmax)) ■= {Cm[t “ t]^ t^e analogous to 

the corresponding definitions in Eq. ©■ 

In |T|, the authors proposed a distributed scheduling policy 
for networks with heterogeneously delayed NSI. We reproduce 
their algorithm verbatim here. The algorithm has two steps. At 
each time slot, 

• Step 1: all the transmitters compute threshold functions 
based on common NSI available at all transmitters. These 
threshold functions, one for each transmitter, map the re¬ 
spective transmitter’s critical NSI to a corresponding thresh¬ 
old value, and are computed by solving the following 
optimization problem: 

argmax Qi[t - '^max ]7?Lx_(T), (8) 

T 

IGC 

where 


p(c) =CHf(S(c,f)), (4) 

where C'Hf{.) is the convex hull taken over all the threshold 
function vectors f. We note that 77 (c) C is the convex 
hull of the expected data transmission rates at time t of all 
links in the network when C[t — T.^max] = c, taken over all 


i?/.r(T):= E 


Cj M l{c,[t]>T,(.)}(7i + (1 - li) 


n i{c™[t]<T„(.)} 

m^Ii 


C[t-T] 


(9) 


^Throughput region (also called stability region) is the set of all supportable 
(defined in the next paragraph) mean arrival rate vectors. 

^Saturated system throughput is the sum of the throughputs on each link 
in the network when the queues at the transmitter node of each link have an 
infinite number of packets backlogged. 

^For each link I, a scheduling policy is a map from the NSI available at 
the transmitter node of link I, {■Pi(Q[t](0 : r,^raax)),'Pi{C[t]{0 : r^max))} 
to a transmit/no-transmit scheduling decision for (the transmitter associated 
with) link /. 

^^It is inherent in the definition of throughput optimality that a policy that is 
throughput optimal need only stabilize the network for any arrival rate vector 
that is supportable by any policy that uses the same information structure. For 
example, it is possible that a policy that has access to only delayed NSI (with 
a given structure) to be throughput optimal and yet not support an arrival rate 
vector that is supportable by a policy that has access to instantaneous NSI for 
all links in the network. 


and Tif) := Ti{CSl^{C[t]{0 : Tmax)))- 

• Step 2: each transmitter observes its current critical NSI, 
evaluates its threshold function (found in Step 1) at this 
critical NSI, and attempts to transmit if and only if its current 
channel rate exceeds the threshold value, i.e., when Ci[t] > 
Ti{CSl^iC[t]iO : Tmax))). 

To briefly illustrate the key idea behind the R policy and the 
difficulty that the problem of scheduling with heterogeneously 
delayed NSI entails, consider a network with three links - 

* * We remark that all the quantities that pertain to the R policy that we show 
with a superscript R (e.g., CS^{.), appear without superscripts in 
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h, h and I 3 with perfect collision and complete interference 
(illustration is easier for this setting). First, let us consider the 
case where the transmitter node of each link has the r-unit 
delayed NSI of each link (the case of homogeneous delays), 
and no link possesses the NSI of any link (including that of 
itself) with a delay lesser than r units (this of course does not 
conform to the heterogeneously delayed NSI setting). In this 
case, the R policy can be thought of as optimally partitioning 
the space of all sample paths (so as to maximize the achieved 
system throughput) into L partitions (where some partitions 
are possibly empty) with the connotation that the ith partition 
contains sample paths for which link li will carry transmission 
when any of these sample paths is realized. To see this, con¬ 
sider the 3-tuple (Ci^ — Ci^ [t — T], Ci^ [t — T]) which give 
rise to eight possibilities when C — {1, 2}. We can think of all 
the sample paths as grouped into eight classes corresponding 
to these eight possibilities of (C/^ [t — r], Ci^ [t — T], [t — r]). 
These eight classes are then partitioned optimally into three 
partitions such that when any sample path in the first partition 
is realized, link li will carry transmission (and links I 2 and (3 
won’t), and so on. When a particular sample path is realized, 
the transmitter nodes of links li, I 2 and Z 3 look at the tuple 
(C/j^ [t — r], Ci^ \t — t], Ci,j [t — r]) and determine the partition 
to which it belongs and thereby decide which of the three 
links will carry transmission. Now, this is true when the delays 
are homogeneous. In the case of heterogeneous delays, the 
problem of partitioning the sample paths is slightly more 
complicated by the fact that the different transmitters in the 
network have disparate views of the state of the network. We 
continue this illustration for the heterogeneously delayed NSI 
setting after introducing the example in Sec. |III-A| 

A. Computational Complexity 

From Expr. 0 , we see that the domain of optimization is 
the set of threshold function vectors T, where T = {Ti{.)}i^c- 
From Expr. 0 we see that, while computing the conditional 
expectation, given a threshold function vector T, each sam¬ 
ple path in the evaluation of the conditional expectation is 
subjected to evaluation by the threshold functions {Ti(.)}/G£. 
As a consequence, there are two aspects to the computational 
complexity of the R policy - namely, (i) functional evaluation 
complexity - the number of threshold function vectors T 
over which the optimization is to be done, and (ii) sample- 
path complexity - the number of sample paths that are to be 
considered in evaluating the conditional expectation. We first 
consider the complexity of functional evaluation and obtain 
an expression for the number of threshold function vectors 
required in the domain of optimization in the general case. 
Next, we characterize the structure on the delay values (in the 
table of delay values) that produces the worst-case functional 
evaluation complexity in the R policy. We then consider the 
complexity of functional evaluation in this worst-case setting, 
and obtain an expression for the number of threshold function 
vectors required in the domain of optimization in the worst 
case. 

Proposition 1. For the R policy, the total number of threshold 
functions that are needed to be considered in the domain of op- 

n y' ^ ^ 

timization in Expr. (0, in general, is (C-f , where 


C is the number of channel states, C is the set of links in the 
network, and Vi := CSl^{C[t]{0 : Tmax))\{Ci,[t-Tmax]}heC 
is the set of input parameters to the threshold function 

We present a proof of this result in Appendix We 
illustrate this result in Example Next, we characterize the 
structure on the delay values that produces the worst-case 
functional evaluation complexity in the R policy. 

Proposition 2. The worst-case complexity of functional eval¬ 
uation in the R policy is realized when the delays in the 
table of delay values are all distinct, and the delay values 
at positions (i,j) in row i of the table of delay values, for 
j = 1,2,... ,i — l,i -\- 1,..., L appear in descending order, 
for all i. 

We relegate the proof of this result to Appendix We 
now characterize the functional evaluation complexity of the 
R policy for the worst-case setting noted in Proposition 

Proposition 3. For the R policy, the total number of threshold 
functions that are needed to be considered in the domain of 
optimization in Expr. dSl, for the worst-case scenario noted in 

rn qHL-2) + {L-1) 

Proposition is (C -\- , where C is the 

number of channel states and L is the number of links in the 
network. 

Erom this result, the number of threshold functions required 
in the domain of optimization in Expr. 0 is doubly exponen¬ 
tial in the number of links in the network, and exponential 
in the number of levels into which the channel states on 
the wireless links are quantized. We present a proof of this 
proposition in Appendix Next, we characterize the sample- 
path complexity of the R policy. 

Proposition 4. For the R policy, the number of sample paths 
that are needed to be considered in the evaluation of the 
conditional expectation in Expr. 0 is given by where 

C is the number of channel states and L is the number of links 
in the network. 

Erom this result, the number of sample paths involved in 
the evaluation of the conditional expectation in Expr. 0 is 
exponential in the number of links in the network and in the 
maximum of the heterogeneous delay values. We present a 
proof of this proposition in Appendix |0 As noted earlier, 
given a threshold function vector T = {T;(.)}/g£, each of 
these sample paths is subjected to evaluation by the 

L threshold functions in T. We now illustrate the functional 
evaluation and sample-path complexities with an example. 

Example 2[^ Consider a three node network with three links - l\, 
I 2 , and I 3 with heterogeneous delays (t 2 )= 

S.rjg {i 2 )= 8 ,T,^ (i 3 )=i 2 ,T ;2 {i 3 )= 6 . Note that these delay values 
are already in the worst-case form of Proposition From these 
delays, we get CS,j (.)={Cij [t-li],C,^ [t-gj.Cij [t- 12 ]}, 

CS, 2 (.) = {Cij[t-ll],C,2[t-9],C,2[t-8].C,3[t-12],C,3[t-6]}, and 

CSi^{.) = {Ci^ [t-ll],C,3 [t-7],C,3 [t-9].C,3 [t-8],C,3 [t-12],Ci3 
Therefore, given {C;are functions of 
3,4 and 5 variables respectively (i.e., in the language of Proposition^ 

Reviewing the proof of Proposition in Appendix l^will be helpful in 
understanding this example fully. 
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Vi = 3 ,V 2 = 4, and V 3 = 5). Taking C = {lj2}, T;^(.) maps the 2^ 
values in its domain to the real numbers say 0.5, 1.5 and 2.5, independentlyp^ 
Thus, the number of choices of threshold functions for (.) is 3^. Similarly, 
the number of choices for Ti^{.) and Ti^{.) are 3^® and 3^^ respectively. 
Therefore, the total number of threshold function vectors T in the domain 
of optimizatiorp^ is 3^®. The number of sample paths to be considered in 
evaluating the conditional expectation is 2^® since Tmax = 12 and L = 3. □ 

Continuing the discussion on the key idea behind the R 
policy and the difficulty that the problem of scheduling with 
heterogeneously delayed NSI entails from the last paragraph 
in Sec. m for the case of heterogeneous delays in the 
above example, when a particular sample path is realized, 
the transmitter node of link li looks at T(C'S'/^(.)) (i.e., at 
T{Ci, [t - 11], Cl, [t - 7], Cl, [t - 9], Cl, [t - 12])) and decides 
whether link li will carry transmission depending on whether 

> T{CS(^{.)), the transmitter node of link I 2 looks at 
T{CSP^i.)) (i.e.,' at [t-11], Ci, [t-9],Ci, [t-8],Ci, [t- 
12], Ci,[t — 6 ])) to decide, and so on. Even though CS^{.), 
CSl^{.) and CSl^{.) do not coincide, it is easy to convince 
oneself that the sample paths will be partitioned into three 
partitions as before, since we are dealing with perfect collision 
interference (i.e., 7 / = 0, V/). 

B. The Twin DQIC Policies 

With the objective of isolating the undesirable consequences 
of using r,„£j 3 ;-delayed queue lengths (in comparison to using 
T.,max-delayed queue lengths) on the delay performance of the 
system, we consider two scheduling policies both of which 
have access to instantaneous CSI but differ (only) in the 
delays that they use to access the queue lengths. The first 
policy, DQICl (for Delayed Queue lengths and Instantaneous 
Channel states), uses Tmax-delayed queue lengths to stabilize 
the queues, whereas the second policy, DQIC2, uses t^ max- 
delayed queue lengths. Specifically, the two policies are; 

DQICl : argmax{Q/[f - Tmax] x 
DQIC2 : argmax max] x Ci[t]}^^^ 

Remark: Note that the only difference between the two policies 
is that the DQICl policy uses Tmax-d^loyed QSI whereas the 
DQIC2 policy uses t^ max-delayed QSI. Consequently, any 
difference in performance between the two policies is directly 
attributable to this difference in the delay values they use to 
access the QSI. 


C. Delay Performance - A Preview 

Towards developing a feel for what to expect in terms of 
the difference in delay performances of the different policies 
outlined in this section, we evaluate these policies through nu¬ 
merical simulation for the following specific instance of a two- 
transmitter network. We consider a network with two links li 
and I 2 that carry transmission from their respective transmitter 

^^See footnote [§ 

'^Incidentally, in this example, considering the delay values before rear¬ 
ranging as noted in Proposition the total number of threshold function 
vectors T in the domain of optimization is also 3®®. 


TABLE II: An instance heterogenous delay values for a wireless 
network with two links 



At TX At 

At TX s' 

Delay in obtaining NSI of link li 

0 

1 

Delay in obtaining NSI of link I2 

X 

0 


^ A, B are transmitter nodes of links l\ and I2 respectively. 


nodes A and B to their respective destinations (with Ii, = {I2} 
and Ii, = {^i}), with the channels on these links modeled 
as independent DTMCs on the state space C = {1,2} with 
crossover probability 0.1. We consider heterogeneous delays 
as noted in Table [H] where x > 1 is a parameter we vary. From 
this table, we see that Tmax — Pi,max — 1^ p 2 ,max — •T. 

The DQICl and DQIC2 policies specialize to this setting 
as follows: 

DQICl : argmax{Q;Jf-x] x Ci,[t], Qi,[t-x] x Ci,[t]} 
DQIC2 : argmaxjQiJf- 1] x Ci,]t], Qi,[t-x] x Ci,[tf\ 

i.e., DQICl schedules link li if Qi, [f — x] x Ci, [f] > Qi, [t — 
x] X Ci,[t], and link I2 otherwise, whereas DQIC2 schedules 
link li if Qi, [f — 1] x Ci, [f] > Qi, [t — x] x Ci, [f], and link I 2 
otherwise. 

Packets arrive into the two queues at the two transmitters 
as independent Poisson processes with rates Ai = A 2 = 7 (for 
this setting, it is easily seen that sum rates (i.e. A 1 -I-A 2 ) of up to 
1.75 are supportable p) Packets are time-stamped on arrival 
and on exit, and on servicing link I successfully, a number 
of packets equal to min(C';[f],Qi[f]) are removed from the 
queue of link 1 . Fig. [TJa) depicts the average queueing delay 
per-packet (in units of time-slots) of the DQICl, DQIC2, R 
and H policies. Fig. [^b) depicts the percentage reduction in 
the average queueing delay per-packet in the DQIC2 policy 
compared to that of the DQICl policy. 

Conclusion; The average per-packet queueing delays of the 
DQICl and R policies grow linearly with Tmax whereas that 
of the DQIC2 policy tends to become almost impervious 
to increase in Tmax- Comparing the DQICl, DQIC2 and 
R policies and their delay performances, the fact that the 
DQICl and R policies use Tmax-delayed queue lengths 
isolates itself as the sole reason for their undesirable delay 
performances. 


IV. The H Policy 


In Sec. II-D 


_ we conjectured that {Ci[t - n^maxjjiGC. 

being less delayed compared to {Ci[t — rmax]}iGCy and being 
commonly available at all transmitters in the network, it may 
be preferable that each transmitter base its transmit/no-transmit 
decision on this information rather than on {Ci[t — Tmax]}iGC- 
In addition to testing this hypothesis, we wish to reduce the 
computational complexity and the average per-packet queueing 
delay in comparison with that of the R policy. Motivated by 


Considering the four possibilities of data rates for Ci^ [f] and Ci^ [f], a 
data rate of 1 unit is supportable when = Ci^[t] = 1 which happens 

with steady-state probability 0.25, and a data rate of 2 units are supportable 
in all the other three cases since when the queues are saturated, both the 
DQICl and DQIC2 policies would choose the link with the largest data 
rate. These three cases happen with steady-state probability 0.25 each. 

^^The LC-ELDR and LC-ERDMC policies are identical for a network 
with only two links (see Algorithm ^ in Sec. |VI[ l. 
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(a) 



Fig. 1: (a) Comparison of average queueing delay per-packet in the DQICl, 
DQIC2, R, and H policies. The R and DQICl policies use r^aa^-delayed 
QSI whereas the H and DQIC2 policies use r.,max-delayed QSI. The 
average per-packet queueing delays of the DQICl and R policies grow 
linearly with Tmax whereas that of the DQIC2 policy tends to become 
almost unaffected by increase in Tmax- Comparing the DQICl, DQIC2 
and R policies and their delay performances, the fact that the DQICl and 
R policies use Tmax-delayed queue lengths isolates itself as the sole reason 
for their undesirable delay performances. The (relatively) small difference in 
the queueing delay performances of the H and DQIC2 policies is due to 
the fact that the DQIC2 policy has access to instantaneous CSI whereas the 
H policy only has access to delayed CSI. (b) Percentage reduction in the 
average queueing delay per-packet in the DQIC2 policy compared to that 
in the DQICl policy. 


these, we define the H policy to be the following scheduling 
policy. In each time slot, 


• Step 1: using the common NSI available at all transmitters, 
each transmitter determines the optimal threshold function 
vector, by solving the following optimization problem: 

argmax Qi[t - n ,max ]-Ri.r_(T), (10) 

T ^^ 

where 


Rlt (T) :=E 


Ci [i] l{Ci[t]>T,(.)} ( 7 ; + (1 - li) 

m^Il 


( 11 ) 


C[t- ] := {Ci[t - n 

,max ]}ZG£. 

T :C5i(C[<](0:r.,™a.))^K, 

C5i(C[i](0 : T.^^ax)) ■■= C5(C[t](0 : T.^ma.)) 

nVi{C[t]{0:T^rna.)), ’ 

and C5(C[i](0 : r^max)) ■= {{Ci[t - Tk{l)]}k&c,k^i)i<^C- 

(13) 

• Step 2: each transmitter observes its current critical NSI, 
evaluates its threshold function (found in Step 1) at this 
critical NSI, and attempts to transmit if and only if its current 
channel rate exceeds the threshold value, i.e., when Ci\t\ > 
T(C5i(C[f](0 : T^max))). 

Example 3: Continuing Example ^ for the H policy, the 
critical NSI of the network C5(C[f](0 : r.^max)) is the set 

{Cz, [t - 1], Cz, [t - 3], Q, [t - 2], Cl, [f - 4], Cz 3 [t - 1], Cz 3 [t - 2]} and 
the critical NSI available at the transmitter nodes of links liR 2 ,l 3 , namely 
CSi,{Clt](0 : 

'r’.jmax)); C5,,(C[t](0 : T.^max ) )) C5i3(C[t](0 ■ T^,7nax')')i 
respectively are {CjJt - - 

and {Ci, [t - 3], [t - 4], ^3 [t - l],Ci^ [t - 2]}. □ 


Remark: It is to be noted that the R and H policies differ only 
in the delay values that they use to access queue lengths and 
channel state information, with the R policy using the delay 
value Tmax o-kid the H policy using the delay value Ti^max to 
access the NSI of link 1. 


A. Computational Complexity 


In this section, similar to the way that we dealt with the 


R policy in Sec. III-A we first consider the complexity 
of functional evaluation and obtain an expression for the 
number of threshold function vectors required in the domain 
of optimization in the general case. Next, we characterize the 
structure on the delay values (in the table of delay values) 
that produces the worst-case functional evaluation complexity 
in the H policy. We then consider the complexity of functional 
evaluation in this worst-case setting, and obtain an expression 
for the number of threshold function vectors required in the 
domain of optimization in the worst case. 


Proposition 5. For the H policy, the total number of threshold 
functions that are nee ded to be considered in the domain of 

v c I Wi I 

optimization in Expr. \10\ , in general, is (C + 
where C is the number of channel states, C is the set of links 
in the network, and Wi ■= C'S'/;(C[f](0 : r^max)) \ {Cii[t — 
Tii,max]}iieC is the set of input parameters to the threshold 
function Tiff). 

We present a proof of this result in Appendix We 
illustrate this result in Example Next, we present the 
following result that will be required later; 

Proposition 6. C'5'f(C[f](0 : Tmax)) = CSi{C[f\{0 : 

T. ,max ) )• 

We provide a proof of this proposition in Appendix We 
next characterize the structure on the delay values that brings 
































out the worst-case functional evaluation complexity in the H 
policy. 

Proposition 7. The worst-case complexity of functional eval¬ 
uation in the H policy is realized when the delays in the 
table of delay values are all distinct, and the delay values 
at positions {i,j) in row i of the table of delay values, for 
j = 1, 2,..., i — 1, i -f 1,..., L appear in descending order, 
for all i. 

We highlight a minor difference in the proof of this propo¬ 
sition compared to that of Proposition in Appendix We 
now characterize the functional evaluation complexity of the 
H policy for the worst-case setting noted in Proposition |7] 

Proposition 8. For the H policy, the total number of threshold 
functions that are needed to be considered in the domain of 
optimization in Expr. for the worst-case scenario noted 
in Propositiot^^ is ^ , where C is the number 

of channel states and L is the number of links in the network. 

We present a proof of this result in Appendix Next, we 
characterize the sample-path complexity of the H policy. 


Proposition 9. For the FI policy, the number of sample paths 
that are needed to be conside red in the evaluation of the 
conditional expectation in Expr. {11 1 is given by 
where C is the number of channel states. 


The proof of this proposition is similar to the proof of 
Proposition and is therefore omitted. We now illustrate with 
an example the reduction in computational complexity that the 
H policy achieves over that of the R policy. 

Example 4\ Consider the same network as in Example]^ After computing 
CSij^(.), CSi.^{.) and CSi^{.) using Eq. jl2| , we see that given {Cft — 
ri,max]}leC’ Tli(fTl 2 (-)’F 3 {-) are functions of 1,2 and 3 variables 
respectively. Using the same arguments as in the proofs of Propositions 
and|^ in Appendices and [h| the number of choices of threshold functions 

for (.), T ^2 (■) respectively. Therefore, the 

total number of threshold function vectors T in the domain of optimization is 
3^"^ (the R policy required 3^^ threshold functions; see Example]^ and the 
number of sample paths required to be considered in evaluating the conditional 
expectation is 2^^ (the R policy required considering 2^^ sample paths; 
see Example]^, yielding an enormously massive reduction in computational 
complexity over that of the R policy. We note that this vast reduction is mainly 
due to the fact that for the H policy, the exponent in the double exponential 
in the expression in Proposition]^ is smaller than that of the R policy. 


B. Delay Performance 

Fig. [^shows the average per-packet queueing delay (in units 
of time-slots) in the H policy for the setting of the example 
in Sec. |III-C| The average per-packet queueing delay of the 
R policy grows linearly with T^ax whereas that of the H 
policy tends to flatten out. Comparing the R, H, DQICl and 
DQIC2 policies and their delay performances, it is clearly 
evident that the use of r ^aa;-delayed queue lengths in the 
H and DQIC2 polices is what gives these policies their 
better delay performances in comparison to those of the R 
and DQICl policies which use r^ax-delayed queue lengths. 


C. Throughput Optimality 

In this section we show that, like the R policy, the H 
policy too is throughput optimal. First, we prove that if 
an arrival process is supportable, then the expected arrival 
rate of this process should lie within the system throughput 
region A defined in Sec. II-F This would then mean that the 
system throughput region A is the region that encompasses 
all supportable arrival rates given the NSI structure in Sec. 
II-D| Next, we show that the H policy stabilizes all arrival 


rate vectors in the system throughput region A. These two 
together would then imply that the H policy is throughput 
optimal. 


Lemma 1. Under the NSI structure noted in Sec. if the 
traffic arrival process {A[f]}t is supportable, then € 

A. 


The proof of this lemma, which is similar to the proof of 
Lemma 4.1 in |[T|, is available in Appendix |I] 

Corollary 1. The system throughput regions defined in Equa¬ 
tion (0. and in Sec. 4.1 in 0. are identical. 

This is an immediate consequence of Lemma 4.1 in |[T) and 
Lemma [T] stated above. See Sec. 4.1 in |[T) for the definition 
of the system throughput region considered in Q. 

Theorem 1. The H policy is throughput optimal. 

The proof of this theorem, which is similar to the proof 
of the corresponding theorem for the R policy in ||T] (but 
with significantly more details) is available in Appendix 
An important implication of this theorem is that using Tpmax- 
delayed NSI instead of r^ax-delayed NSI (for each link 1) 
does not harm the achievable system throughput, a fact that 
we exploit crucially in designing our computationally efficient 
near-throughput-optimal policies in Sec. 


V. Analytical Characterization oe Delay 
Performance 

In this section, we characterize the average queueing 
delay per packet in the DQICl, DQIC2, and R policies 
analytically. To do this, we first fix a time-horizon, consider 
all possible packet arrival patterns and all possible channel 
states in each slot within this time horizon and compute the 
average delay experienced by a packet in each of the above 
scheduling policies in the limit as the time-horizon tends to 
infinity. We need the following additional notation: 


T := Time horizon (i.e, the number of time slots 
over which we measure the average delay 
per packet) 


Ni := Number of packets that have been serviced 
from time t = 0 to t = T from the queue 
associated with link li 
c 

N := Ni is the total number of packets in the system 
i=i that have been serviced from time f = 0 to 
t = T from the queue associated with any 
link in the network 
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aij ■■= 


= 


The arrival time of the 
jth packet from the head 
of line in the queue asso¬ 
ciated with link I 


if j < Qi[t] 

otherwise 



/ 1 

if a is true 

!{.} 

l{a} := 

1 0 

otherwise 


{a,b} ■= 

l{a} X 1 

{b} 


x'^ := 

max{0, x\ 

For the 


T-l 




arrivals,CSI 


E Ei{} 


2 = 1 


Ci,[t] 

E 

i=i 


!{.} := 


Qi^ [t] xCi Jt] > max{Qij_ [tJxCij, [t]}| if t<0 

^Qii [t-TT7iax]xCi. [i] > max{(3ij, [t]}, 

Qi; [t-TmaxJxCi Ji] > max{(3i^ [t-T„oj;]xC(j, [t]} I if t>0 
V A:>i J 


With these definitions in place, and with the assumption that 
the queues are purged of the packets that were transmitted in 
a particular time slot at the end of that time slot, we are now 
ready to characterize the average per-packet queueing delay in 
these policies. First, we characterize Darrivais,csi - the long¬ 
term average queueing delay conditioned on the knowledge of 
the channel states of all links in the network for all time slots 
(i.e., given {Ci[t]}iec, for all t such that -r^ax <t< T-l) 
and conditioned on the availability of the actual arrivals at all 
links in the network for all time slots (i.e., for all 

t such that —Tmax < t < T — 1) 3 lS follows: 


TV’ 

(14) 

where the indicator function in Eq. indicates the 

condition under which link li is scheduled for transmission 
in time slot t, and is is defined, for the various policies, as 
followsP^ 

For the DQICl Policy: 

If 

IQi- [t]y.Ci. [t] > max{Q;j, [t]xCi^, [t]}, 


For the DQIC2 Policy: 

If 

iQi.lt]xCi.lt]> max{Q;j, [t]xCi^, [t]}. 

Qlilt]xCi.lt]>niax{Qii^[t]xCi^[t]}'^ if t<0 
[t-Ti,max]xCl; [t] > max{Qi^ lt-Tl^rn.a^]xCli, [i]}, 
Qlilt-Ti^-^a.a;]xCl.[i\>-max{Qii^[t-Tl^rna,^]xCli^[t]}'^ if t>0 

' ^{-l •“ ^{CiJt]>r,.(cS«(C[t](0:r„„,)))} 

For the H Policy: 1„ := l{c,p]>T,(cs;(c[*]( 0 :..,„„.)))} 

Note that the knowledge of the number of packet arrivals 
into the queue associated with link in each time slot t, for 
—Tmax < f < T — 1 is subsumed in the ai.j term and the 
knowledge of the channel state on link li in each time slot t, 
for —Tmax < t < T — 1 IS accounted for in the fact that the 
queues are purged of the packets transmitted in a time slot at 
the end of that time slot, which in turn affects the ai-j term. 
Also note that any packets remaining in the queues after time 
slot T — 1 are not serviced and hence do not contribute to the 
average queueing delay. 

Now, removing the conditioning on knowledge of CSI, we 
get the expression for Darrivais - the average per-packet 
queueing delay (conditioned only on the arrivals) as follows: 


Dn 


vals — 


E 


D. 


^ ■' t = — Tma. 


arrivals,CSI j 
T —1 m—1 


(Cm[ I~max]} 


T-2 

n 


Pcm Cm[n + 1] 


(15) 

where 7r(c) is the steady-state probability of being in state 
c in the channel-state Markov chain, and pij is the one-step 
transition probability from state i to state j in the channel-state 
Markov chain. 

Finally, removing the conditioning on the knowledge of 
arrivals on each link in each time slot t, —Tmax ^ t < T — 1, 
we get the expression for D - the required average per-packet 
queueing delay as follows: 


In the indicator function in the expressions above, we 
note that the splitting of the comparison of the product of 
queue-length and channel state on link li with that on the 
other links, into two - namely, (i) comparison with that 
on links lk,k < i, and (ii) comparison with that on links 
lk,k > i, creates a lexicographic ordering among the links. 
This is required to consistently resolve the “winner” in case 
there is a tie in the queue-length channel-state product on 
multiple links - we always resolve in favor of the smallest i 
(as a convention) in case of a tie. 


Note that the time index t starts from —Tmax in Eq. 0 since at time 
t = 0, Tmax-delayed NSI would be the NSI at time t = —Tmax- Note 
that Ni (and hence N) does not include the packets serviced in time slots 
t = — Tmax to f = —1. 


c / T-l \ 

D ■■= ^ Darmvals Y[i (A; [f] = O; [f]) j 

r, , ,, I T-l 1 = 1 \t = -T„^aa: / 

^ ^ 't = — 'rmax 

(16) 

A. Computational Complexity 

Since Expr. is not in closed form, we will have to 
resort to numerical evaluation of this expression for a fixed 
finite value of T. For a fixed finite T, the number of time slots 
over which Expr. needs to be evaluated is Tmax + T. At 
each link in the network, the number of packet arrivals in each 
time slot can be any of the values in the set {0,1,..., Amax}, 
where Amax is the maximum number of packets that can 
arrive in each time slot (see Sec. |II-C|l. Thus, the number 
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of possible arrival streams at a link (over all the r^ax + T 
time slots), is given by {Amax + and therefore 

the number of possible arrival streams taking all the links 
together is {A^ax + Further, in case of Markov 

chains where any state can be reached from any other state 
in one step (as in the setting considered in Sec. VII-Cl, the 
channel state on each link can be any of the C states, where 
C is the number of states in the channel Markov chain. Thus, 
for each possible arrival stream pattern at each link, over the 
fixed finite time horizon T, the number of possible channel 
state transition patterns at a link is given by 
and therefore the number of possible channel state transition 
patterns taking all the links together is Thus, 

using Eq. ®, the computational complexity of comprehen¬ 
sively evaluating the average queueing delay per-packet over 
a fixed finite T is {Amax + x or 

{{Amax + operations. Thus, for example, in 

a network with 2 transmitters, where in each time slot, the 
number of packet arrivals can only be 0 or 1 , and where 
the channel state Markov chain has only 2 states, for a time 
horizon of T = 10, and for r^ax = 2, the computational 
complexity is 2 ^® (or, roughly 10 ^"^) operations. 


VI. Low Complexity Scheduling Policies 


The H policy, despite the immense reduction in computa¬ 
tional complexity that it achieves in comparison to the R pol¬ 
icy, is still computationally complex, and therefore impractical. 
In this section, we propose and evaluate two fast and near- 
throughput-optimal scheduling policies - namely, LC-ELDR 
(for Eliminate link with Least Data Rate), and LC-ERDMC 
(for Eliminate link that Reduces Delays for Maximum number 
of Channels) - for the heterogeneously delayed NSI setting, 
confining our attention to the more pragmatic case of perfect 
collision interference (i.e., 7 ; = 0, VZ G £). Initially, we con¬ 
sider the case of complete interference (i.e., R = C \ {Z} VZ G 
£), and consider extension to the case of multiple interference 
sets subsequently. These low-complexity scheduling policies 
derive their main idea from the H policy; the idea being that 
the common delay value that all the contending transmitters 
(i.e., transmitters corresponding to contending links) can use 
to access the NSI of some link Z (of at least one link), could be 
reduced (and hence more recent [i.e., more reliable] delayed 
link-statistics could be made use of in computing the schedule) 
by carefully choosing to eliminate one link from among the 
contending links. These policies are iterative (we call each 
iteration, a “round”) and they operate by choosing to eliminate 
one link in each round; the two policies differ only in the 
criterion they use for selecting the link that they eliminate 
in each round. Further, eliminating one link in each round 
accords these policies polynomial running times as we show 


in Sec. VI-C We list the LC-ELDR policy in Algorithm 
[T] and demonstrate its working in detail in Sec. VI-A The 
LC-ERDMC policy is identical to the LC-ELDR policy 
listed in Algorithm [T] except for step 16 which is listed 
separately in Algorithm We derive analytical expressions 
for the exact expected saturated system throughputs of the 


LC-ELDR and LC-ERDMC policies in Sec. VI-B and plot 


TABLE III: Heterogeneous delays for the illustration in Sec. 


VI-A 



At TX 

At TX 

At TX 

At TX 


A 

B 

c 

D 

Delay in obtaining NSI of link li 

0 

4 

1 

1 

Delay in obtaining NSI of link I2 

1 

0 

1 

2 

Delay in obtaining NSI of link Z3 

1 

1 

0 

5 

Delay in obtaining NSI of link ^4 

3 

1 

1 

0 


these expressions in Sec. VII and show that these policies are 
near-optimal. 


A. Dynamics of the LC-ELDR Policy 

We now demonstrate the working of the LC-ELDR policy 
for a network with four wireless links - Zi, Z2, Z3, Z4 (with A, 

B, C, D as the transmitter nodes on these links, respectively), 
all contending for transmission in the current time slot. We 
will assume that the queues at the transmitter node of these 
links are saturated, and that the heterogeneous delays are as 
in Table [111] Let each of the wireless links be modeled as a 
Markov chain on the state space C = {1,2} with crossover 
probability 0.1 (the case of very slow varying channel). The 
n-step transition probability matrices for this Markov chain 
are as below (shown for only those delays [n-values] that are 
required in our computation); 


p(i) = 

■ 0.9 0.1 ■ 

0.1 0.9 J ’ 

p(2) _ 

0.82 0.18 ■ 

0.18 0.82 J ’ 

p(3) _ 

■ 0.756 0.244 ' 

0.244 0.756 . 

, = 

■ 0.7048 0.2952 ' 

. 0.2952 0.7048 . 

pO _ 

■ 0.6638 0.3362 

0.3362 0.6638 




We assume the following delayed channel-realizations for our 
illustration: Cijf — 4] =2, — 1] = 1, Ci^[t — 2] = 2, 

Cl, [Z - 1] = 1, C/3 [Z - 5] = 2, C/3 [Z - 1] = 2, Cu [Z - 3] = 2, 
Ci^[t — 1] = 1. We first illustrate the working of the 
LC-ELDR policy when it is executed at transmitter C, 
and towards the end we mention the minor differences in 
the dynamics when it is executed at the other transmitters. 
Since we are initially concerned with the case of complete 
interference, we set ActiveSet = (Zi, Z 2 , Z 3 , Z 4 } before 
invoking Algorithm where ActiveSet is the set of links that 
are (still) contending for transmission in the current time slot. 

(step 2) transmitter C sets I = {Z 3 } 

ROUND lEE 

(step 4) transmitter C computes Ti.^max, for i = 1,2, 3, 4, 
which results in n.^^ax = 4, n^^max = 2, n^^rnax = 5, 
Tii,max = 3. This is illustrated in the last column of the table 
in Fig. I^a) 

(step 5) transmitter C computes the expected data rate on 
each link Z/ in ActiveSet given the -delayed channel- 

state realization of that link, as follow sj^ E[CiAt]\CiAt- 

^^the node where this algorithm is being run 

*®We will call the “while” loop body from line 3 to line 23, and also the 
computation in lines 24 — 29 in Algorithm 1, a “round”. 

*^we ignore the multiplication of the delayed queue-lengths with the ex¬ 
pected data rates since we are considering saturated queues for this illustration. 
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Fig. 2: Illustration of the dynamics of the LC-ELDR policy. In the beginning, ActiveSet = Subfigure (a) shows the original table of delay 

values and values computed in step 4 of Algorithm^in round 1. Subfigures (b), (c), (d) show the computation of the set EC in round 1 (step 7). In 

subfigure (b), after setting aside the link with the largest expected data rate (link I 2 ; shown by highlighting in green the column of transmitter B [the transmitter 
node of link I 2 J), we see whether eliminating link li reduces the ti -max delays for any other link in ActiveSet. This is done by temporarily masking the row 
and column pertaining to link /i (shown hatched), recomputing the ri^max delays and comparing with previously computed values. The value of ri^^^max 
reduces (from 3 to 1), and hence link is a candidate for elimination, and therefore belongs to EC. Similarly, in subfigure (c), we determine that link 
does not belong to EC, and in subfigure (d), we determine that link ^ belongs to EC. In subfigure (e), we eliminate link li (shown by striking-through in 
red the row and column pertaining to link li) since only li and I4, belong to EC and li has smaller expected data rate than I4 (steps 16, 21). This reduces 
ActiveSet to {/ 2 , ^ 3 ,/ 4 }.Subfigures (f), (g), (h) show similar calculations for round 2, at the end of which I4 is eliminated, leaving behind only I2 and in 
ActiveSet. 


4] = 2] = 1.7048, E[Ci^[t]\Ci^[t - 2 ] = 2] = 1.82, 
E[Ci,[t] I Ci,[t - 5] = 2] = 1.6638, E[Ci,[t] \ Ci,[t - 3] = 
2] = 1.756 

(step 6) transmitter C sets aside link I 2 by setting H = I 2 , 
since link I 2 has the largest expected data rate in this round, 
as computed in the previous step (this is done so that the 
link with the largest expected data rate in this round is not 
eliminated). This is shown by highlighting transmitter i?’s {B 
is the transmitter node on link I 2 ) column in green in Fig. 

I3b)- 

(step 7) transmitter C, to decide on the link it wants to 
eliminate in this round, computes EC as follows: 

(i) first, transmitter C temporarily masks the row and column 
pertaining to link li (shown by hatching the correspond¬ 
ing row and column, in Fig. |^b)), computes the ti. ^max 
values for links li, i = 2,3,4 (i.e., for all links other 
than link ( 1 ), compares them to the previously computed 
Tii,max values for these links, finds that upon masking the 
row and column pertaining to link li, the r;. max value 
for at least one link (namely, that of link (4) reduces 
(from 3 to 1; see Fig. |^b)), and hence decides that link 
Zi is a candidate for elimination, and adds it to the set of 
elimination candidates, EC. What this means is that if 
transmitter C does choose to eliminate link ( 1 , each of the 
transmitters corresponding to the remaining links would 
then be able to access the delayed channel state of link 
Z 4 at a reduced common delay value of 1 unit (instead 
of 3 units, as was required previously) since the other 
transmitters assuredly possess the delayed channel-state 
of link (4 at this new reduced delay of 1 unit 

(ii) transmitter C skips this procedure for link I 2 since I 2 , 
being the link with the largest expected data rate, was set 
aside in the previous step 

(iii) next, transmitter C temporarily masks the row and col¬ 
umn pertaining to link (3 (i.e., its own link; shown again 
by hatching the row and column of link (3, in Fig. |^c)), 
computes the r;. max values for links li for i = 1,2,4 
(i.e., for all links other than link (3), compares them to 


the previously computed Ti.^max values for these links, 
finds that upon masking the row and column pertaining 
to link ( 3 , the r;. max value for none of the links li for 
i = 1,2,4 reduce, and hence decides that link I 3 (i.e., its 
own link) is not a candidate for elimination 

(iv) lastly, transmitter C temporarily masks the row and 
column pertaining to link Z 4 (shown by hatching the row 
and column of link ( 4 , in Fig.|^d)), computes the max 
values for links for i = 1, 2, 3 (i.e., for all links other 
than link ( 4 ), compares them to the previously computed 
Tii.max values for these links, finds that upon masking the 
row and column pertaining to link I 4 , the n^^max value for 
at least one link (namely, that of links I 2 and ( 3 ) reduces 
(from 2 to 1 for link I 2 , and from 5 to 1 for link ( 3 ; see 
Fig. I^d)), and hence decides that link I 4 is a candidate 
for elimination, and adds it to the set of elimination 
candidates, EC. This means that if transmitter C does 
choose to eliminate link I 4 , it is assured that each of the 
links remaining in ActiveSet would then be able to access 
the channel state of links I 2 and (3 at reduced delays of 
1 unit and 1 unit respectively (instead of 2 and 5 units 
respectively, as was required previously) 

Thus, transmitter C has computed the set of elimination 
candidates to be EC = {( 1 , ( 4 } 

(step 16) Among the elimination candidates in set EC, 
transmitter C chooses link li to be the one that will be 
eliminated in this round by setting S = li, since link li has 
the lowest expected data rate among li and I 4 (which have 
expected data rates of 1.7048 and 1.756, respectively; see step 
5 above). This is shown by striking out the row and column 
pertaining to link li in Fig. |^e) 

(step 21) Transmitter C removes link li from ActiveSet 
(since link li, having been eliminated in this round, is 
no longer in contention with the other links in ActiveSet 
for carrying transmission in the current time slot). Thus, 
ActiveSet = {( 2 , ^ 3 ,^ 4 } is the revised set of links still 
contending 
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Algorithm 1 

1: procedure LC-ELDR{ActiveSet) 

2: 1 ■(— all links in the network for which this nod^^is the 

transmitter 

3: while \ ActweSet\ > 2 do 

4: Compute Ti^max Vi € ActiveSet from the delay table after 

suppressing the rows and columns corresponding to links 
not in ActiveSet 

5: Compute the queue-length weighted expected data rate 

that will be realized if link I is allowed to carry trans¬ 
mission given the r;_maa;-delayed channel state of link I, 
for all links I in ActiveSet, as follows: Qi\t — n^rnax] x 
E[Ci[t] \ Ci[t- Ti^rnax] = , where is 

the channel-state realization of link I at time t — n^rnax 

6: Let H be the link with the largest queue-length weighted 

expected data rate computed in Step [^(arbitrarily chosen 
if more than one satisfy this criterion) 

7: Let EC (for Elimination Candidates) be the subset of 

ActiveSet\{H} such that if link iV G ActiveSet\{H} 
then link K G EC if, on recomputing the delays 
Ti,max V( G ActiveSet after masking the rows and 
columns corresponding to link K and the links not 
in ActiveSet from the table of delay values, there is 
a reduction in the n^rnax value of at least one link 
I G ActiveSet \ {K} 

8: if EC = (p (the empty set) then 

9: \f H £ I then 

10: Set transmit decision of H = TRANSMIT; For 

all / G / \ {fT}, set transmit decision of I = 
NoTransmit 

11: else 

12: For all I G I, set transmit decision of I = 

NoTransmit 

13: end if 

14: Exit. 

15: else 

16: Let S be the link in EC with the lowest queue- 

length weighted expected data rate computed in Step 
[^ (arbitrarily chosen if more than one satisfy this 
criterion) 

17: if S G 1 then 

18: Set transmit decision of S' = NoTransmit. 

19: if I = <j) then Exit. 

20: end if 

21: ActiveSet t— ActiveSet \ {S} 

22: end if 

23: end while 

24: Recompute the delays n^rnax as in Step [^ Recompute the 

queue-length weighted expected data rates as in Step[^ Let 
T G ActiveSet be the link with the largest expected data rate 
(arbitrarily chosen if more than one satisfy this criterion) 

25: if T G 1 then 

26: Set transmit decision of T = TRANSMIT. For all I G 

I \ {T}, set transmit decision of I = NoTransmit 

27: else 

28: For all I G I, set transmit decision of I = NoTransmit 

29: end if 

30: end procedure 


ROUND 2: 

Note: For all purposes, the table of delay values for this round 
is the original table minus the row and column corresponding 
to link li since it was eliminated in the previous round. 

(step 4) transmitter C sets Ti^^max = X (don’t care) since link 
li does not belong to ActiveSet anymore. Transmitter C then 
calculates the new values of n.^max, for i = 2,3,4. All these 
values could potentially be smaller than the corresponding 
values in round 1 , since when calculating these values in 
round 2, transmitter C ignores the delay values in the row 
and column pertaining to link li which was eliminated in the 
previous round. The new values are: Ti^^max = ‘2;Ti^,max = 
S,Ti^,max = 1- Note that n^^max has reduced from 3 (in the 
previous round) to 1 (see Fig. |^e)). 

(step 5) as in round 1, transmitter C computes the expected 
data rate on each link li in ActiveSet given the ti. max-delayed 
channel-state realization of that link (as computed in step 
4 above), as follows: E[Ci^[t]\Ci 2 [t — 2] = 2] = 1.82, 
E[Ci,[t] I Ci^[t - 5] = 2] = 1.6638, E[Ci,[t] \ Ci,[t - 1] = 
1 ] = 1.1 

(step 6) as in round 1, transmitter C sets aside the link that 
has the largest expected data rate in this round, as computed 
in the previous step. Coincidentally, link I2 turns out to be that 
link in this round also. Therefore, transmitter C sets El = 12- 
This is shown by highlighting transmitter B's column in green 
inFig.jgf). 

(step 7) once again, transmitter C goes about computing EC 
(to decide on the link it wants to eliminate in this round) as 
follows: 

(i) first, transmitter C decides to skip this procedure for link 
1 1 (since it is not in ActiveSet anymore), and for link I2 
(since I2, being the link with the largest expected data 
rate, was set aside in step 6 above) 

(ii) next, following similar arguments as in round 1 , trans¬ 
mitter C decides that link (3 (i.e., its own link) is not a 
candidate for elimination (see Fig. [^f)) 

(iii) lastly, again following similar arguments as in round 
1, transmitter C decides that link I4 is a candidate 
for elimination, and adds it to the set of elimination 
candidates, EC (see Fig. |^g)). 

Thus, transmitter C has computed the set of elimination 
candidates fo he EC = {li} 

(step 16) since there is only one elimination candidate 
(namely, link (4) in set EC, transmitter C chooses to eliminate 
link (4 by setting S = I 4 . This is shown by striking out the 
row and column pertaining to link I4 in Fig. |^h) 

(step 21) Transmitter C removes link I4 from ActiveSet 
(since link I4, having been eliminated in this round, is 
no longer in contention with the other links in ActiveSet 
for carrying transmission in the current time slot). Thus, 
ActiveSet = {^21 ^ 3 } is the revised set of links still contending 

ROUND 3: 

(step 24) transmitter C sets n-^^max and n^^max to X (don’t 
care) since links li and I4 do not belong to ActiveSet anymore. 
Transmitter C then calculates the new values of R.^max^ 
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for i = 2,3 to be 


= n. 


= 1. Transmit- Algorithm 2 


ter C computes the expected data rate on each link li in 
ActiveSet given the Ti. max-deiayed channel-state realization 
of that link (where the new Ti.^max value is as computed 
only just), as follows: E\Ci^[t\\Ci^\t — 1 ] = l] = 1 . 1 , 
[^^3 [^] I ^^3 — 1-9. Note that T/ 2 ,rnaa; and Ti^ Ynax 

have changed from 2 and 5 respectively (in the previous round) 
to 1 each (see Fig.j^h)). Transmitter C sets T = I 3 since link 
I 3 (i.e., its own link) has the largest expected data rate as 
computed above 

(step 26) Transmitter C sets the transmit decision for link (3 
(i.e, for its own link) to Transmit 


procedure LC-ERDMC(ActiveSet) 

16 : Let S be the link in EC with the lowest expected data rate 

among those links that, upon their elimination, reduce the 
maximum of the delay values in a row, for the largest number 
of rows (i.e., channels) [corresponding to links in ActiveSet] 
in the table of delay values [after suppressing the rows and 
columns corresponding to links that are not in ActiveSet] 
end procedure 


mathematically later in Appendices and M Let N be the 
number of links and T := {1, 2,..., iV} be the set of links in 
ActiveSet before calling the LC-ELDR policy. Further, let 


(step 30) Transmitter C terminates this run of the LC-ELDR 
policy. 

It is easy to convince oneself that all the transmitters running 
the LC-ELDR policy in the current time slot would arrive at 
the same decision - links other than would set their transmit 
decision to NoTransmit while link sets its transmit 
decision to Transmit. To illustrate this in short, transmitter 
D for example, when running the LC-ELDR policy in the 
current time slot, would make the same calculations as shown 
above till step 16 in round 2, whereupon transmitter D would 
execute step 18 (note that I — {I 4 } in this case) and thereby 
set the transmit decision for link I 4 (i.e., its own link) to 
NoTransmit, and terminate this run of the algorithm at 
transmitter D at step 19. 

So far, we have considered the case of networks with 
complete interference. Extension to the case of networks with 
multiple interference sets is as follows: (i) Let ActiveSet 
contain all the links in the network. Set M (j). Set 5ij ^ 
0 V/i ^ Ii- Vi, j where 5ij is the delay value in row i and 
column j of the delay table (ii) Run the LC-ELDR policy. Let 

I be the link chosen for transmission by the LC-ELDR policy. 
Set M ^ M U {/}. Set ActiveSet ^ ActiveSet \ ({Z} U /;). 
(iii) Repeat step (ii) while ActiveSet is non-empty. When 
ActiveSet becomes empty, the links in M are the set of links 
that will be allowed to carry transmission in the current time 
slot. 

The policy LC-ERDMC is a minor variation of 
LC-ELDR, where only step 16 of LC-ELDR is modified 
as noted in Algorithm In the example above, in step 16 
of round 1, the LC-ELDR policy chose to eliminate link 

II since it had a lower expected data rate compared to that of 
link I 4 , whereas the LC-ERDMC policy chooses to eliminate 
link I 4 since eliminating link I 4 reduces the delays with which 
the channel state of two channels (namely, I 2 and Z 3 ) can be 
accessed, whereas eliminating link li would reduce the delay 
of only one channel (namely, I 4 ). 


B. Throughput Near-Optimality 

In this section, we derive analytical expressions for the ex¬ 
pected saturated system throughputs of the LC-ELDR and 
LC-ERDAIC policies. We evaluate these expressions in 
Sec. VII for various network settings, and demonstrate that 
these expressions approximate the optimal throughput values 
very closely. We need a few definitions, which we introduce 
here rather informally, and make these definitions precise 


l{a} := 


1 if a is true 
0 otherwise 
■= max{l{ai}}. 


i&I 


We will call the “while” loop body from line 3 to line 
23, and also the computation in lines 24-29 in the algorithm 
listing of the LC-ELDR policy in Sec. VI a “round”. Thus, 


if the LC-ELDR policy terminates at line 26 or at line 28, 
then it would have executed N —1 rounds (specifically, N — 2 
rounds in the body of the “while” loop, and the last round 
[round — 1] in lines 24-29), and it would have executed 
r {r < N — 1) rounds if it terminates at line 14 or 19. Thus, 
the LC-ELDR policy can terminate after r rounds, 1 < r < 
N -1. 

Let (not to be confused with Ti{h) defined in Sec. 
II-DI be equal to ti. ^max (see Sec. II-DI at the beginning of 
round r, where Ti.^rnax is calculated after masking the rows 
and columns pertaining to the links that have been eliminated 
in rounds 1 to r — 1 (as illustrated in Figs. |^a), |^e), and 
l^h)). Let Cfc T- be the realization of channel-state on link l^. at 
time t — T. 

Let be the link that was (or that will be) eliminated 
(by the LC-ELDR policy) in round r (thus, = fc” [for 
k G T] implies that link k was (or will be) eliminated in round 
r, and = 0 ” implies that there is no candidate link to 
eliminate, and hence that the LC-ELDR policy terminates). 

Let i S r be the link chosen by the LC-ELDR policy 
in a particular time slot, say t, when the LC-ELDR policy 
terminates after executing r rounds, 1 < r < TV — 1. We 
now consider the working of the LC-ELDR policy when it 
is executed at the transmitter node of link i in time slot t. 
Consider a particular round f (1 < f < r) of the LC-ELDR 
policy. If link i has survived (i.e., was not eliminated in) the 
first f — 1 rounds, then it will survive round f if one of the 
following happens: 

(i) s(i,r, 1): link i has the largest expected data rate in 
round f (hence i = H and therefore i ^ EC', see steps 
6 and 7 of Algorithm [T]). We denote this condition as 
s(z,f,l). 

(ii) s{i, f, 2 ): link i does not have the largest expected data 
rate in round r (hence i 7 ^ H), and link i does not 
reduce the maximum of the delay values in a row, for 
any row (corresponding to a link in ActiveSet) in the table 
of delay values (after suppressing the rows and columns 
corresponding to links that have been eliminated up to 
round f) if it is eliminated in round r (hence i ^ EC', 
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see step 7 of Algorithm [^l, and some link (other than 
the one with the largest expected data rate in round r) 
reduces the maximum of the delay values in a row, for at 
least one row if it is eliminated in round f (i.e., EC ^ (p). 
We denote this condition as s{i,r,2). 

(iii) s{i, f, 3): link i does not have the largest expected data 
rate in round r (hence i ^ H), and link i reduces the 
maximum of the delay values in a row, for at least one 
row (corresponding to a link in ActiveSet) in the table 
of delay values (after suppressing the rows and columns 
corresponding to links that have been eliminated up to 
round f) if it is eliminated in round f (hence i G EC), and 
link i does not have the smallest expected data rate among 
the links that reduce the maximum of the delay values in 
a row, for at least one row if that link is eliminated in 
round r (hence i ^ S', see step 16 of Algorithm [T]). We 
denote this condition as 3). 

With the required definitions in place, we are now ready to 
state the following crucial result; 


Proposition 10. The expected saturated system throughput of 
the LC-ELDR policy is exactly 


N N-1 


EE E® 


r—l 


Cl 


3 

J&T 


r—l 


nG{l,2,3} 


X 7^0}^l{s(z,r, = O} 

= vr>- 

lCl.\t — = c. (,) I 

I ^ JgG{1.2.....r-l} 


peniri 

gG{l,2,. 


N / r-1 ' 

m—1 \ n—1 , 


n 


where 'Km{c) is the steady-state probability of being in state 
c in the channel-state Markov chain of link m, and ij is 
the transition probability of reaching state j from state i in 
k steps in the channel-state Markov chain of link m, with 
^( 0 ) _ 1 w„- ^ I20I 


= 1 Vi, j,TO 

We present mathematically precise definitions of the sym¬ 
bols that were defined informally in the paragraphs preceding 
Proposition 10 and the proof itself in Appendix]^ 

Proposition 11. The expression for the expected saturated 
system throughput of the LC-ERDMC policy is the same 
as the expression in Proposition except that the expression 
for eV) is redefined to be as noted in Appendix ^ 

The proof of this proposition is essentially the same as that 
for Proposition [To} the minor departures are noted in Appendix 
[M| We evaluate the expressions in Propositions and o 


= 7r(c) and Vm S £ in our model; see Sec. II-A 


numerically and plot them for various network settings in 
Sec. VII and demonstrate that these expressions approximate 
the optimal throughput values very closely and hence that the 
LC-ELDR and LC-ERDMC policies are near-throughput- 
optimal. 


C. Computational Complexity 
We have the following result: 


Proposition 12. The running times of both of the LC-ELDR 
and LC-ERDMC policies are 0{CL^ -\- L'^). 


We provide a proof of this result in Appendix [K| Thus from 
this proposition, for the case of a network with complete in¬ 
terference, the computational complexities of the LC-ELDR 
and LC-ERDMC policies are polynomial in the number 
links in the network and linear in the number of channel states. 
For a network with multiple interference sets, in the worst 
case (which happens when = 4>, VI G C), there are L 
calls to Algorithm [T] thus there is an additional multiplicative 
factor of L, which does not alter the (orders of) polynomial 
and linear dependencies on the number of links and the 
number of channel states respectively. Thus, the LC-ELDR 
and LC-ERDMC policies offer an enormous improvement 
over the computational complexities of the R and H policies. 


We compare the running times of these policies in Table VII 


VII. Numerical Results and Discussion 

In this section, we present numerical results comparing 
the saturated system throughputs of the R, H, LC-ELDR 
and LC-ERDMC policies. In addition, for purposes of 
comparison, we define two new policies that we call O and 
IC: 


Policy O: aig inax{Qi[t - Ti^rnax] x E[Ci[t] \Ci[t- '^l,7nax\ — 

Policy IC: argmax{(3i[f] x Ci[i\}i^c- 

The O policy chooses the link I that has the largest 
value of queue-length weighted expected data rate on the 
link, computed using the -delayed NSI of that link. 

Note that the O policy corresponds to stopping after round 1 
in the LC-ELDR and LC-ERDMC policies and declaring 
link H the “winner” (see step of Algorithm [TJ. The 
IC policy chooses the link I that has the largest value of 
queue-length weighted data rate on the link, computed using 
instantaneous NSI of that link, with the assumption that each 
link has access to the instantaneous NSI for all channels. 
We emphasize that the IC policy does not conform to the 
structure of delayed NSI noted in Sec. El but it serves 
as an upper bound on the system throughput that can be 
achieved in the delayed NSI regime. 


A. Methodology 

As in Ul, we consider networks with complete interference 
(i.e, Ii = C \ {Z}, VZ G C) and perfect collision (i.e., 7 ; = 
0, VZ G C). The channel state on each link is modeled as 
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TABLE IV: The Very Small Delays (VSD) delay profile for a wireless 
network with three links 



At TX A 

At TX B 

AtTX C 

Delay in obtaining NSI of link li 

0 

1 

1 

Delay in obtaining NSI of link I2 

1 

0 

1 

Delay in obtaining NSI of link I3 

1 

2 

0 
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Fig. 3 : Expected saturated system throughputs (in data units transmitted) per 
time slot of the various policies for a network with three links, VSD delay 
profile and different channel profiles. The curves of all but the IC policy 
overlap. 


a DTMC on the state space C — {1,2}. We only consider 
single-hop transmissions in the network. We implemented all 
the policies in a custom C-H- simulator. All the numbers from 
simulation that we quote in this section were averaged over 
10 ^ trials. 


B. Results and Discussion - Throughput 

Fig. shows the expected saturated system throughputs of 
the various policies for a network with three links, for the very 
small delay (VSD) delay profile (see Table IV i, for different 
channel profiles. For this purpose, we define five channel 
profiles - namely, very slow varying channel (VSVC), slow 
varying channel (SVC), medium varying channel (MWC), fast 
varying channel (FVC) and very fast varying channel (VFVC), 
with channel-state crossover probabilities of 0.1, 0.3,0.5, 0.7 
and 0.9 respectively. We observe that all the policies perform 
equally well when the delay values are very small (the plots 
for all but the IC policy overlap in Fig. [^. 

Fig. 0 shows the expected saturated system throughputs 
of the LC-ELDR, LC-ERDMC, O and IC polices for 
a network with three links and VSVC channel profile, for 
different delay profiles. For this purpose, in addition to the 
VSD delay profile noted in Table |IV[ we define four other 
delay profiles - namely, small delays (SD), medium delays 
(MD), large delays (LD), and very large delays (VLD) as 
shown below; 
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delay profiles 

Fig. 4 : Expected saturated system throughputs (in data units transmitted) 
per time slot of LC-ELDR, LC-ERDMC, O and IC policies for a 
network with three links, VSVC channel profile and different delay profiles. 
The analysis and simulation curves of each policy overlap. Throughput of 
LC-ERDMC is marginally below and marginally above that of LC-ELDR 
for the MD and LD delay profiles respectively. 


TABLE V: Table of heterogeneous delay values for the Very Small 
Delays (VSD) delay profile for a network with ten links 
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Similar to the entries in Table IV the entry at position ij 


in each of the delay profiles is to be interpreted as the least 
delay with which the NSI of link li is available at transmitter 
j (i.e., at the transmitter node of link Ij). We do not show the 
system throughputs of the R and H polices in Fig. since 
the delay values for all but the VSD profile are such that 
it is computationally hard to evaluate the saturated system 
throughputs of these policies. We note that the LC-ELDR 
and LC-ERDMC policies perform almost equally well, and 
both of them outperform the O policy. Fig. also depicts the 
degradation of system throughput with increasing delay values. 

Fig. 0 shows the expected saturated system throughputs of 
the R, H, LC-ELDR, LC-ERDMC, O and IC polices 
for the case of VSVC channel profile and VSD delay profile, 
for different number of links in the network. The VSD delay 
profile, noted in Table [V] is for a network with ten links. The 
VSD delay profile for a network with n links, 2 < n < 10, is 
derived from this table by taking the nxn sub-matrix hinged at 
the top-left corner of this table. We show analysis plots of the 
LC-ELDR and LC-ERDMC policies only up to #links = 5 
since computing these values for networks with larger number 
of links is computationally costly. We note that the throughputs 
of both the LC-ELDR and LC-ERDMC policies overlap 
with the optimal throughput values of the R and H policies. 
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Fig. 5: Expected saturated system throughputs (in data units transmitted) per 
time slot of the R, H, LC-ELDR, LC-ERDMC, O and IC policies 
for a network with VSVC channel profile and VSD delay profile, for 
different number of links in the network (analysis curves of LC-ELDR and 
LC-ERDMC are shown only up to #links = 5). The analysis and simulation 
curves of each policy overlap. 


TABLE VI: Table of heterogeneous delay values for the Medium 
Delays (MD) delay profile for a network with ten links 
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Fig. 6: Expected saturated system throughputs (in data units transmitted) per 
time slot of the LC-ELDR, LC-ERDMC, O and IC policies for a net¬ 
work with VSVC channel profile and MD delay profile, for different number 
of links in the network (analysis curves of LC-ELDR and LC-ERDMC 
are shown only up to #links = 5). The analysis and simulation curves of each 
policy overlap. 



delay profiles 

Fig. 7: Expected saturated system throughputs (in data units transmitted) per 
time slot of the R, H, LC-ELDR, LC-ERDMC policies for networks 
with four links, VSVC channel profile and different delay profiles (DP). 

Fig. |6] shows the expected saturated system throughputs 
of the LC-ELDR, LC-ERDMC, O and IC polices for 
the case of VSVC channel profile and MD delay profile, for 
different number of links in the network. The MD delay profile 
for a network with ten links is shown in Table |y|] As before, 
the MD delay profile for a network with n links, 2 < n < 10, 
is derived from this table by taking the n x n sub-matrix 
hinged at the top-left corner of this table. We do not show 
the system throughputs of the R and H polices in Fig. 
since the MD delay profile is such that it is computationally 
hard to evaluate the saturated system throughputs of these 
policies. Again, as before, we show analysis plots of the 
LC-ELDR and LC-ERDMC policies only up to #links = 5 
since computing these values for networks with more links is 
computationally costly. We note that both of the LC-ELDR 
and LC-ERDMC policies outperform the O policy. We also 
note, more importantly, that there are situations (#links = 3, 
4) where LC-ELDR outperforms LC-ERDMC and other 
situations (#links = 6 to 10) where LC-ERDMC outperforms 
LC-ELDR, implying that none of these policies could be 
throughput optimal 

Fig.lZl shows the expected saturated system throughputs 
of the R, H, LC-ELDR and LC-ERDMC polices for 
networks with four links, VSVC channel profile and different 
delay profiles (DP for short; see below). We first comment 
that the high computational cost of the R and H policies 
hinder us from measuring their performances for very many 
delay profiles, and for all the delay profiles that we could 
measure, the expected saturated system throughputs of the 
LC-ELDR and LC-ERDMC policies were less than 1.1% 
(and in many cases less than 0.5%) away from the optimal 

^^With reference to the low-complexity scheduling policies in Sec. |Vl| in 
the process of obtaining a link that would maximize the expected system 
throughput, in each iteration, the LC-ELDR policy discards the link with 
the lowest expected data rate, and the LC-ERDMC policy discards the 
link that, upon its elimination, reduces the delay values for the largest 
number of channels (see step |I6| of the algorithms). Even though taking 
recourse to eliminating one link in each iteration gives the LC-ELDR and 
LC-ERDMC policies their short running times, the approaches that these 
policies take to achieve this, as alluded to in Fig.[^ are not optimal. It remains 
unresolved as to whether there is an optimal strategy to isolate a link that 
would be eliminated in each iteration, and if one exists, what its structure 
should be. This needs further exploration. 
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TABLE VII: Comparison of the running times of the R, LC-ELDR 
and LC-ERDMC policies. All times are in microseconds. 


# Links 

R Policy^^ 

LC-ELDR 

Policy^ 

LC-ERDMC 

Policy^ 

5 

4 X 10^ 

11 

10 

10 

1.1 X 10*^ 

102 

96 

15 

3.9 X 10® 

165 

151 

20 

8.14 X lO’’ 

247 

237 


t As reported in 

§ Information about the delay profile used in fU is not available. We are 
reasonably sure that the delay profile used in (l) uses smaller delay values 
than those in the MD delay profile that we have used here for the LC-ELDR 
and LC-ERDMC policies, since evaluating the R policy for the MD delay 
profile is prohibitively time consuming. Thus the running time values that we 
mention here for the LC-ELDR and LC-ERDMC policies are relatively 
pessimistic, and provide a more-than-fair comparison. 

I As measured on a computer with Intel Core 2 Duo processor clocked at 2 
GHz, with 2 GB RAM. 

I For the MD delay profile whose delay values for Minks = 5, 10 are shown 
in Table [vT| The delay values in the MD delay profile for Minks = 15, 20 are 
similar to those shown in Table [vT] 


throughput values of the R and H policies, thus demonstrating 
the near-optimality of throughputs of the LC-ELDR and 
LC-ERDMC policies. The following are the definitions of 
the six delay profiles referred to in Fig. (the entry at position 
ij has the same interpretation as before): 
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In Table VII we compare the running times of the R, 


LC-ELDR and LC-ERDMC policies for a network with 
varying number of links, VS VC channel profile, and MD 
delay profile. Evidently, in stark contrast to the R policy, the 
LC-ELDR and LC-ERDMC policies are computationally 
very efficient, validating the computational complexity analy¬ 
ses in Propositions and \12\ 


C. Results and Discussion - Queueing Delay 

In this section, we evaluate the queueing delay performances 
of the DQICl, DQIC2, R and H policies numerically and 
compare these with the queueing delay values obtained by 
evaluating the analytical expression in Eq. for these 

policies. In Sec. |V-A[ we saw that the computational cost of 
comprehensively evaluating Expr. ( [T6| ) is prohibitively expen¬ 
sive even for a small network that has only 2 transmitters, 
when there are only two channel states for any link, and when 
the number of packet arrivals in any slot is only 0 or 1, for 
a small number of time slots T = 10. While reducing T to 
smaller values (say T = 5) will allow for a comprehensive 
evaluation of Expr. ( [Th] ), small values of T do not allow us to 
capture the dynamics of packet arrivals (and hence of queue 
lengths) and of channel state transitions precisely enough. In 
fact, the number of time slots (T) over which we evaluate Expr. 


should be at least of the order of a few hundred to be 
able to capture the queue and channel transition dynamics with 
reasonable accuracy. Therefore, we resort to evaluating Expr. 
( [T6| ) not comprehensively but for a few arrival streams and for 
a few channel state sample paths. We do this in two ways - 
(i) by selecting these sample arrival streams and channel state 
sample paths randomly, and (ii) by selecting typical (i.e., high 
probability) sample arrival streams and typical channel state 
sample paths. This gives us leeway in evaluating Expr. 
for larger values of T. 



T delay values (x) 

max ^ ' ' 


(a) 



T delay values (x) 

(b) 

Fig. 8: Comparison of analytical and simulation results for the average queue¬ 
ing delay per-packet in the DQICl and DQIC2 policies. The DQICI 
policy uses -delayed QSI whereas the DQIC2 policy uses r^rnax- 

delayed QSI. The analytical queuing delay values are obtained by using 100 
randomly chosen arrival streams and channel state sample paths in (a) and 
using 100 typical (i.e., high probability) anival streams and channel state 
sample paths in (b). The match between analytical and simulation results is 
substantially more precise when using typical sequences for arrival streams 
and channel transitions. 


Eirst, as in Sec. we consider a network with two 

transmitters (links), with the channels on all links modeled 
as independent Markov chains on the state space {1,2} 
with channel state transition probability 0.1. We consider 
heterogeneous delays as noted in Table where a; > 1 is 
a parameter we vary. Packets arrive into the two queues at 
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the two transmitters as independent Poisson processes with 
rates Ai = A 2 = | (for this setting, it is easily seen that 
sum rates (i.e. Ai + A 2 ) of up to 1.75 are supportable; see 
footnote [m. In Fig. we compare the average queueing 
delay performances of the DQICl and DQIC2 policies. For 
the analytical plots we take the time horizon T = 2000 and 
for the simulation plots we average the delay values over 10 ^ 
trials. For reasons mentioned in the previous paragraph, for 
the analytical plot in Fig. [^a), at each of the two links in the 
network, we consider a 100 random arrival streams and a 100 
random channel state sample paths, and for the analytical plot 
in Fig.j^b), we consider a 100 typical arrival streams and 100 
typical channel state sample paths (this gives a total of 100 ^ 
or 10 ® different arrival stream - channel sample path pairs). 
We observe that the match between analytical and simulation 
results is very precise when using typical sequences for arrival 
streams and channel transitions. 

Next, in Fig. |^a), we compare the average queueing delay 
performances of the R and H policies, and in Fig. [^b), that 
of R, LC-ELDR and LC-ERDMC policiesp^Because of 
the prohibitively large computational complexity involved in 
evaluating the R and E[ policies, we consider only 10 typical 
arrival streams and 10 typical channel state sample paths (for 
a total of 10 "^ different arrival stream - channel sample path 
pairs). Again, the match between analytical and simulation 
results is precise validating the gains in delay performance 
that can be obtained by using r -delayed QSI. 

Finally, in Fig. |^c), concentrating on the delayed queue- 
lengths at link Zi, we compare the correlation coefficients 
between the queue-lengths at Qi^ [f] (i.e., the instantaneous 
queue-length) and Qi^ \t — x\ for a; = 1 ,..., 10 in the DQICl 
and DQIC2 policies. Since in our example, for the case 
Tmax = 2, the DQIC2 policy uses the queue-length at time 
t — 1 and the DQICl policy uses the queue-length at time 
t—2, we have highlighted these values with small filled circles. 
As we would expect, we see that the queue-length at time t—1 
(i.e. at time t — used by the DQIC2 policy is more 

positively correlated with the instantaneous queue-length than 
is the queue-length at time t — x for a; = 2,..., 10 used by the 
DQICl policy, thus explaining the delay performance gains 
of the DQIC2 policy over that of the DQICl policy. 

VIII. Conclusion 

In this work, addressing the problem of distributed schedul¬ 
ing in wireless networks with heterogeneously delayed NSI, 
we proposed, analyzed and evaluated the performances of 
two fast and near-throughput-optimal scheduling policies. En- 
route, we identified and dealt with two deficiencies (namely, 
non-optimal delay performance and high computational com¬ 
plexity) in an earlier work in |[T]. We showed that the in¬ 
formation afforded by the system model could be exploited 
more aggressively, and that by doing so, immense reduction 
in computational complexity and substantial reduction in the 
expected per-packet queueing delay could be obtained. We 
proposed a provably throughput-optimal scheduling policy that 

^^The LC-ELDR and LC-ELDR are identical for the case of networks 
with only two links. 


embodies these ideas. The proposed fast and near-throughput- 
optimal scheduling policies champion these ideas further, pos¬ 
sess desirable queueing delay characteristics, and have running 
times that are of the order of microseconds, sufficiently small 
for use in practical deployments. 
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Appendix A 

Proof of Proposition[T] 

Since Tiy.) is a function of : Tmax)) and 

since {C/Jf — Tmax]}ii^c is made available as part of the 
computation of the conditional expectation (the given) in Eq. 
(|^, the number of parameters to the threshold function Tj. (.) 
is the number of elements in the set Vi := (7S';^(C[f](0 : 
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(a) (b) (c) 

Fig. 9: (a) Comparison of average queueing delay per-packet in the R and H policies. The R policy uses Tmax-delayed QSI whereas the H policy uses 
T,Tnaa:"delayed QSI. The average per-packet queueing delays of the R policy grows linearly with Tmax whereas that of the H policy is sub-linear tending 
to become almost unaffected by increase in Tmax- Comparing the R and H policies and their delay performances, the fact that the R policy uses Tmax- 
delayed queue lengths isolates itself as the sole reason for its undesirable delay performance, (b) Comparison of average queueing delay per-packet in the 
R and LC-ELDRJLC-ERDMC policies. The LC-ELDR and LC-ERDMC policies, like the H policy, use r ^y^a^j-delayed QSI. (c) Comparison of 
correlation coefficients of queue length values in Qi^ [f] and Qi^ [t — x] in the DQICl and DQIC2 policies. 


Each of the random variables in CSl^{.) can take any of 
the C values where C is the number of channel states. Thus, 
for Ti.{.), there are different values in the domain of 
Each of these in the domain of Tj.(.) can be 

independently mapped to a real number. In Proposition [T^ we 
show that it suffices to consider C + 1 carefully chosen real 
numbers in the range of T;. (.). Eor example, from Proposition 


13 for C = {1,2}, it suffices to consider the real numbers 
0.5,1.5, 2.5 as the set of possible values in the range of T/. (.) 
that each element in the domain of Tj. (.) can be mapped 
to. Thus, there are (C + possible choices of threshold 

functions for Ti -(.). Therefore, considering all the links in the 
network, the total number of threshold functions that need to 
be considered in the domain of optimization in Expression ([Hll 

V' I Vi I 

is (C + l)^'.e^^ . □ 

Proposition 13. It is sufficient to consider (C+1) real numbers 
in the range of each threshold function Ti{.). 

Proof: Consider the interval S = {ci, Q+i) on the real number 
line for any integer i G {0,1,...,Cj, ci,C 2 ,...,cm S C 
(where cm = cg) are the channel states in the channel state 
DTMC (see Sec. II-A l and cq = ci — Ip^ Consider a fixed 
real number ri G S, and a set T C C. Evidently, ri shares the 
same relationship (i.e., <, <, >, >) with each element of T as 
all other real numbers in S. Hence, it is sufficient to pick ri as 
a representative of all real numbers in S. The result follows 
noting that there are C + 1 intervals S for each of which a 
representative real number ti has to be chosen. □ 


Appendix B 

Proof of Proposition|2] 

Ti{.) is a function of CS^{.), and hence the number 
of choices of threshold functions for Ti{.) is a function of 

use C to denote both the set and its cardinality. Thus, as set, C = 
{ci, C2 ,..., cm}^ and as cardinality, C = M. 


the number of random variables in which in turn 

is dependent on CS^{.) through Expr. Q. Evidently, the 
number of random variables in CS^{.) is maximized when 
the delays in the table of delay values are all distinct. 

The number of random variables in the set C'S'/^(.) 
pertaining to each link Ik (i.e., entries of the form [t — r] 
in the set CSl^{.)) is maximized when Ti^{lk) is the smallest 
of all the delay values in row k. This is because, the 
transmitter node of link li, having the CSI of link Ik with 
the delay n^ilk) (which is the smallest delay in row k), also 
has the CSI of link Ik with delays larger than Ti^{lk), and 
hence the random variables {Cijf - 

are all present in the set CS[^{.). Similarly, after fixing 
the delay value at position {k, 1) to be the smallest value 
in row k as noted above, the number of random variables 
in the set CS^{.) pertaining to link Ik (i.e., entries of the 
form Ci^.[t — r] in the set CS';^(.)) is maximized when 
Ti^{lk) is the second-smallest of all the delay values in row 
k (second-smallest since, Ti^{lk) being the smallest in row k, 
it can’t be that Ti^{lk) < 

Eollowing the structure noted above, we rearrange all the 
delay values in each row i in ascending order (except the 
delay value of 0 at position {i,i) which is left as is). We 
note that this structure on the delays in the table of delay 
values characterizes a worst-case scenario for the R policy. 
Einally, the structure on the table of delay values as needed 
in the statement of the proposition is obtained by swapping 
columns j and L — j + 1 for j = 1,..., L^/2J, which is 
legitimate since the transmitters are symmetric for purposes 
of calculating functional evaluation complexity. 


Appendix C 

Proof of Proposition!!] 

T/(.) is a function of CS^f). Given the structure of worst- 
case delay values noted in Proposition we see that for 
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transmitter 1 , the set CSP^{.) has a number of random variables 
that is the sum of the values in the L-tuple (L— 1, 1 , 1 ,, 1 ), 
since transmitter 1 has NSI of link li with zero delay and hence 
possesses NSI of link li with delays equal to all other L — 1 
values in row 1 of the delay table, and since transmitter 1 has 
NSI of link h {2 < i < L) with a delay that is the maximum 
for link li across all transmitters. Similarly, for transmitter 
2 , the set CSl^{.) has a number of random variables that is 
the sum of the values in the L-tuple {1,L — 1,2,2, , 2), 
since transmitter 2 has NSI of link li with a delay that is 
the maximum for link li across all transmitters, and since 
transmitter 2 has NSI of link I 2 with zero delay and hence 
possesses NSI of link I 2 with delays equal to all other L — 1 
values in row 2 of the delay table, and since transmitter 2 
has NSI of link k (^ < i < L) with a delay that is second- 
maximum for link k across all transmitters. Generalizing, we 
see that for transmitter i, the set C'S'/^(.) has a number of 
random variables that is the sum of the values in the L- 
tuple {i — l,i — 1, ... ,i — 1,L — l,i,i,... ,i) (where the 
term L — 1 is at position i in the L-tuple). Also, given 
{Ci[t — Tmax]}i^c^ the number of random variables in each 
of the sets CS[^{.), 1 < i < L, reduce by 1. Therefore, 
the number of parameters for T;j^(.), and Tj.(.) are 

respectively ((L— 1 )-|-(L— 1 ) — 1 ), ( 1 -|-(L— 1 )-|- 2 (L — 2 ) — 1 ), 
and {{i — 1)^ -f (L — 1) -f i{L — i) — 1) which simplifies to 
{i{L — 2) + {L — 1)). In the language of Propositionthese 
are the values of |Vi|, IV 2 I and |Vi| respectively. The required 
result follows immediately on appealing to Proposition □ 

Appendix D 

Proof of Proposition^ 

The conditional expectation in Expr. •El is evaluated given 
C[f — Tmax]- Therefore, there are Tmax random variables in 
the sample path for each link I (namely, the random variables 
Ci[t — {Tmax — 1)], Ci[t]). Hence, the total number 

of random variables in the sample path is Lxmax- Each of 
these random variables can take C values. Consequently, the 
number of sample paths required in evaluating the conditional 
expectation in Expr. (j^ is 


Appendix E 

Proof of Proposition|5] 

The arguments are similar to those in the proof of Propo¬ 
sition [T] We present it nevertheless for purposes of complete¬ 
ness. 

Since T;.(.) is a function of C'S';.(C[f](0 : T max)) and 
since {Ci.[t — Ti.^max]}ii&c is made available as part of the 
computation of the conditional expectation (the given) in Eq. 
0 , the number of parameters to the threshold function T/. (.) 
is the number of elements in the set Wi := CS'i;(C[t ](0 : 

))\{CiAt-n i ,max ]}kG£- 

Each of the random variables in CSiA ) can take any of 
the C values where C is the number of channel states. Thus, 
for TiA-)^ there are different values in the domain of 

TiA-)- Each of these in the domain of T;. (.) can be 

independently mapped to a real number. Erom Proposition [T^ 


it suffices to consider C + 1 carefully chosen real numbers 


in the range of TiA-)- For example, from Proposition 13 


for C = {1,2}, it suffices to consider the real numbers 
0.5,1.5, 2.5 as the set of possible values in the range of TiA-) 
that each element in the domain of TiA-) can be mapped to. 
Thus, there are (C -f 1)“'''^’' possible choices of threshold 
functions for TiA-)- Therefore, considering all the links in 
the network, the total number of threshold functions that need 
to be considered in the domain of optimization in Expression 

" ,w., ^ 


is (C -f 1 )^' 


Appendix E 

Proof of Proposition^ 

By definition (see Expr. (§ and @), C'5'^(C[f](0 : 
Tmax)) = C'5'(C[f](0 : T^max))- The pi'oof is complete 
by considering the Expr. Q and 0 , and noting that 
C'S'^(C[f](0 : Tmax)) and C'5'(C[f](0 : T^max)) do not 
have CSI of link m with delay values greater than Tm,max, 
for any m G C, leaving the CSI of link m with delays 
greater than Tm.max in T’;^(C[f ](0 : Tmax)) (and hence in 
T’f (C[f](0 : Tmax))) redundant. 


Appendix G 

Proof of Proposition!?] 

We first note from Proposition that, the number of 
random variables in the sets CSA{.) and CSi{.) are the 
same. The rest of the arguments are similar to those in proof 
of Proposition and therefore omitted. 


Appendix H 

Proof of Proposition|8] 

Ti{.) is a function of CSi{.). Eollowing similar arguments 
as in the proof of Proposition in Appendix we note 
that, for transmitter i, the number of random variables 
in the set CS”/.(.) is the sum of the values in the L- 
tuple (z — l,i — 1,.. . ,i — 1,L — l,i,i,... ,i) (where the 
term L — 1 is at position i in the L-tuple). Also, given 
{Ci[t — Ti^max]}i£C 7 the number of random variables in each 
of the sets CSi.{-)^ 1 < * < T, reduce by L. Therefore, the 
number of parameters for the zth threshold function TiA-) is 
{{i — l)'^ + {L—l)+i{L — i) — L) which simplifies to i{L — 2). 
Thus, in the language of Proposition \Wi\ = i{L — 2). 
Eollowing similar arguments as in the proof of Proposition 
in Appendix the total number of threshold functions that 
are needed to be considered in the domain of optimization in 


Expr. (10 1 is {C + 1) 






Appendix I 
Proof of Lemma[T] 

The proof of this lemma is similar to the proof of 
Lemma 4.1 in Q. We are given that {A[f]}t is supportable. 
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This implies that there exists a policy T that has the 
following properties; (i) T uses ki time units delayed NSl 
of link li in making its scheduling decisions at time t 
(where n^rnax < ki < t), (ii) T stabilizes the Markov 
chain Z^[t] = {Q[f](0 : Tk^max),C[t]{0 : Tk,max)}, 

wher^Q[f ](0 : Tk,max) ■= {Qii[t]{0 ■ Tk.^max)}iiGC, and 
C[f](0 : Tk^max) ■■= {^,^(0 : Tki,max)}h(^c- We note that 
T could possibly use queue-state information in making its 
scheduling decisions. 

First, from T, we construct a new policy J^', on the 
system state Markov chain Y[t\ = {(5z[t](0 : Tl^max)^C\t^^{Q : 
Ti^max)}iGC as noted nextp^ Consider z G Z^[t] where z = 
{tl(r',,maa: “t” 1 • ^ Y 1 ; A;)c(0 ! T^^rnax)^•> 

where qir^max + 1 : k) := {qi^{Ti.^^ax + 1 : A*)}ziG£. 
q (0 

• '^.,max^ ■ — {9z(0 : n ,max )}ig£, and c(r 

,max + l : k) 

and c(0 : r^max) are similarly defined. We “collapse” 
Z^[t] onto [f] as follows. Let I be the link that policy 
T schedules in state z. Then z is collapsed onto state 
y e Y^'[t] where y = {q(0 : T^m.ax),c{0 : r^max)}, with 
F' scheduling link I in state y with probability 
(=: 7r(y,l)) where z' are all the states in Z^[t] that 
“collapse” onto state y (i.e. states in Z^[t] that match the 
NSI {q(0 : T^rnax),c{0 ■ T.max)}) such that F schedules link 
I in state z', and where 7 f(z') is the stationary probability of 
being in state z' under the policy F. Let 7 r(y) = 0- 

In Lemma we show that the probability with which link I 
is scheduled in the policies F and F' is the same. 

For y = (q,c), we let r{y) = Pr(q|c) where 

Pr(q,c) = 7 r(?/). From F\ we construct a new policy 
Fs, whose scheduling decisions are independent of all 
queue-state information, on the system-state Markov chain 
Y[t] = {Q[f](0 : T^max),C[t]{0 : T^max)} as follows - at 
each time, when the delayed CSI C[f](0 : T.^max) = c, the 
policy Fs probabilistically schedules link I with probability 
I]y=(q,c) ^(y> 0 /^( 2 /) X ’I'iy)- In Lemma we show that 
the probability with which link I is schedulea in the policies 
F' and Fs is the same. This would then imply that Fs 
supports yl[f]. Since Fs makes its scheduling decisions based 
on the delayed CSI C[f](0 : r^max) = c (oblivious of queue 
state information), and since the service rates of all policies 
using the delayed CSI C[f](0 : r mas) = c are considered in 
defining A, we have G A. 

■ 

Lemma 2. The policies F and F' schedule a link I with the 
same probability. 

Proof: Policy F schedules link I with probability 'k(z), 
where z are all the states in Z^[t] such that F schedules 
link I in state z. Policy F' schedules link I with probability 
where z' are all the states in Z^[t] that 
“collapse” into state y G Y^ [f] such that F schedules link I in 
state z'. This quantity is equal to 0 (alternatively, in 

state y G Y[t], the policy F' schedules link I with probability 
TT{y,l)/TT{y), and hence, overall, policy F' schedules link I 
with probability Ey ^( 2 /> 0 /^( 2 /) x = Y.y^ivJ)')- The 

write Z^[t] as Z^[t] to improve legibility. 

^^we write Y[t] and Y'X [t] as Y[t] and [t] respectively, to improve 
legibility. 


proof is complete by noting that J^z '^(■^) = Sy ^(j/j 0 since 
states z include all states in Z^[t] such that F schedules link 
I in state z, not just those that “collapse” into a particular 
state y G Y^ [f]. 


Lemma 3. The policies F' and Fs schedule a link I with the 
same probability. 

Proof: From the proof of Lemma we see that the policy F' 
schedules link I with probability Given that the 

delayed CSI C[f](0 : r^max) = c, the policy Fs schedules 
link I with probability: 


E ^ X '■(!') 


y=(q.c) 


Ky) 

E TT(y. /) / \ , / \ 

X r{y), where y = (q,c) 

. Y X Pt(,ic) = Y ^ X 

‘ ^ TTI in 




Ay) 

^{yA) „ ^( 2 /) 


7 r(i/) Pr(c) 


= E 


%{y) Pr(c) 

^{yA) 

Pr(c) 


Now, averaging over all c, we see that the policy Fs schedules 
link I with probability 


'kjy, 2 ) 
Pr(c) 


X Pr(c) = '^Tt{yA) 
(q.r) 


E^(2/,0- 

V 


Appendix J 
Proof of Theorem[T] 
Consider the optimization formulation 


arg maxy^ - 

^l,max ]I?/,r„„.(F(.)), (17) 

T^c 


Ri,r (F(.)) := E[Ci [t]FA.)iyi + (1 - 7 ,) 

UmelM-Prn{.)))\C[t-T]], 

C[t 

'^.,max^ ■ — {Ci[t - n 

,max ]}ie£, and 

Fi : rAC[t]{0 : T^max)) ^ {0,1} V2 G C, 


with the semantics 
of each link I is 
F;iVAC[t]iO : T 
optimizing solution 


that, at time t, the transmitter node 
allowed to transmit if and only if 
,max))) = 1- From Lemma H the 

(namely, F*(.) = {P)*(^iG£) 


satisfies the threshold property F^ {Vi{C[t]{0 : T^max))) = 


4C'i[t]>T*('P<(C[t](0:T,, 


^)))}, from Lemma |5| the optimizing 
solution depends only on the critical set NST for each link I, 
i.e. Ti*iViiC[t]iO : r^max))) = r;(C5;(C[t](0 : r,„„,))), 
and from Lemma the F[ policy supports any mean arrival 
rate vector A that satisfies (1 -f e)A G A for any e > 0, 
but from Lemma [T] A is the region that encompasses all 


supportable arrival rates given the NSI structure in Sec. II-D 


whence it follows that the FI policy is throughput optimal. 
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Lemma 4. F;{Vi{C[t]{0 

• '^.,‘max))') — 

l{C,[t]>T*CP,(C[t](0:r,„„J))} 

Proof: The proof of this lemma is similar to the proof of 
Lemma 4.2 (part one) in Q, and is therefore omitted. 


~ + 1] ~ Qi[t]){Qi[t + 1] + Qi[i\) I 

Q[f- C[t- 

'^.,max\ ]) 

where Q[f - r^^ax] = {Qi[t - Ti^max]}iGC and C[t - r^max] 
= {Cl[t - Tl 

,max 


Lemma 5. T*{Vi{C[t]{Q : t„))) = T*{CSi[C[t]{Q : 

,max)}} 

Proof: Even though a proof similar to the proof of Lemma 
4.2 (part two) in 13 applies, we give an alternate more 
intuitive proof here. First, consider the NSI of link li at 
time t - T, where t < n^^min, 

Since no transmitter has NSI of link li at time t — t, none 
of the transmitters can make their thresholding decision 
based on C/Jf — r], and therefore, C/Jf — r] is absent 
from CS”;; (C[f](0 : T.^max)) for all k G L. Next, consider 
two links li,lj,i,j ^ 1 such that + 1 and 

is the next larger delay value after r/.(^i) in the row 
corresponding to link li in the table of delay values. Consider 
a delay value r such that r;. ((i) < r < i.e. t does 

not appear in the row corresponding to link li in the table 
of delay values, or in other words, no transmitter has the 
NSI of link li with delay r. Now, transmitter node of link 
li does not need to make its thresholding decision based on 
r since it has NSI of link li with a delay smaller than t 
(since < r) and the channel is Markovian. Therefore, 

els';. (C[f](0 : T^rnax)) need not contain CjJf — r]. Lastly, 
consider two links ^ 1 as above, and a delay value 

T such that Ti-ili) < T = Ti^{li). While it may appear that 
CSi^{.) need not contain C/Jf — r] since < r and 

the channel is Markovian, Ci^ \t — r] is nonetheless required 
to be in CSi.{.) since transmitter node of link Ij makes 
its thresholding decision based on r, and Ci^ \t — r] helps 
in arbitrating when comparing the thresholding functions 
Ti^{.) and From the above arguments it follows that 

the optimal solution does not depend on any channel state 
information that is not critical NSI. 


Lemma 6. The H policy supports any mean arrival rate 
vector X that satisfies (1 + e)A G X for any e > 0. 

Proof: We first define a Lyapunov function of the queue 
lengths available in the system state as follows: 

V[t]:=V[Y^[t]]-.= J2Qnt] (18) 

lec 

For a given arrival rate vector A that satisfies (1 + e)A G A 
such that e > 0 , we show that the expected change, from 
one time slot to the next, in the sum of the squares of the 
queue lengths at all the links in the network, is negative for 
all but a finite number of states. This implies, from Foster’s 
theorem (see |fl 8 ]|), that the system state Markov chain 
is positive recurrent, establishing that the network remains 
stochastically stable for this arrival rate vector. 

From Equation ( fTS] ), we have 

E[ V[t + 1] - V[t] I Q[f - T^max], C[t - T.,max] ] 


Since in each time slot, the arrivals into a queue and departures 
out of this queue are both bounded, we have 

+1] ~ Qi[t]){Qi[t +1] + Qi[t]) I 

Q[t- C[t- 

'^.^max] ] 

< Ki +E[X;ie£(Qi[f+l] - Ql[t])i‘2Qi[t - Tl^max]) I 

Cl[t-T^max],C[t-T^rnax]] (19) 


Next, using the queue update equation we show that 

E[ + 1] - - Ti^rnax]) I 

Q[t- C[t- 

'^.^max] ] 

< E[ EleciRl,r..r..AT*mQl[t - Tl,r^ax]) \ 

T^max]^ C [f 'C.^max] ] 

where T* is the optimal T resulting from the optimization 
formulation in the scheduling policy. 

Using the queue update equation (Eq. Q), we have 

Qi[t + 1] — Qi[i\ < Ai[t] + 

Multiplying both sides of this inequality by 2Qi[t — t/ j„ax] 
and taking conditional expectation with respect to Q[f—r max] 
and C[f — T^rnax], we have 

E[(Qi[f+l] - Ql[t]){2Qi[t - Ti^rnax]) \ Qit - T^max], 

C[t T^,max] ] 

E[ ^max 2 Qi\t ’Tl^rnax] \ T,max]; C[f ‘^.,max] ] 

+ E[Sl[t]2Qi[t - Ti^^ax] I Q[t T,max];C[^ T,max] ] 

( 20 ) 

We note that the first term on the RHS is a constant, since 
the value of the term Qi[t - n^rnax] is given in Q[f - r^max]- 
We let ki represent this constant, and also note that for the 
H policy, the factor Si[t] in the second term is the same as 
Ri t „„^(T*). Therefore, we have 


E[((5i[f + 1] - Qi[t]){2Qi[t - Ti^ 
< fc; + E[ Ri^t 


]) I Q[t-T.x 
C[t- r.,, 

(T*) 2 Qi[t — Ti^max] I 

Q [f T,max]; r", max] ] 


( 21 ) 


Summing LHS and RHS of this inequality over all I, and 
noting that expectation is a linear operator, we have 

+ ^] - Ql[t]){‘^Ql[t - n,niax]) 1 Q[t 

C [t '^,,max] ] 

<K2+ 2Qi[t - r^.max] ] 

Q[^ T,,maa:]; C[t ‘^.,max] ] (^2) 
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From Inequalities ( [T^ and ( |22| ), we have 

'^i^c^QA^ + 1] ~ Qi[A){QA^ + 1] + QAA) I 

< ^3 + E[ E;e£^i,r (T*) 2Qi [t - Tl,max] \ 

Q[t-T.^max]:C[t-T.^rnax]] (23) 


That is, 

E [ + 1] - V[t] I Q[i - T^max], C[t - T^max] ] 

<K3+ nElecRl,r.,^.JT^*) ^QAt - n,max] \ 

Q[^ '^.^max]i C[t 'T.^max] ] (24) 

Since (1 + e)A S A, there exists {j 7 (c)}c such that 

?7 := ^ 7 r(c)^(c) G A, and (25) 

cSC^ 

^ TT (c) ((1 + e)Ai - f); (c)) < 0 Wl e C (26) 

cGC^ 

From the optimization formulation of the H policy, we also 
have 

X;ig£(E[ I Q[t - T^rnax],C[t - T^max] =c]) 

^ {‘^QA^ '^l.max]') 

>JAleC^Ac){‘^QAt-Tl.,niax]) (27) 


This is because, T* is an optimal threshold, and for a given 
c, the term p/(c) in the RHS is the Ah component of the point 
f){c) G M^, and this point r/(c) arises from a specific weighted 
combination of Ts (the weights being the fraction of times 
these Ts are chosen), where some of these Ts may not be 
optimal. The expectation term in the LHS results from giving 
a weight of 1 to T*. Also, given T*, the term Q[f — r max] 
in the conditional expectation on the LHS is redundant. 

Let K be an upper bound on the LHS of Inequality (27 1 
such that this upper bound is at least twice as large as the 
value of the expression on the LHS. Then, the quantity AT 
minus the value of the expression on the RHS serves as an 
upper bound on the value of the expression on the LHS. That 
is, 

X;ig£(E[ I Q[f - r,max],C[f - T,max] =c]) 

^ {‘^QA^ 2"/max]) 

< ^ - Z]jg£ (c) (2Qi [f - Ti^rnax] ) (28) 


From Inequality ^26f , we have 

i-mic)) < -(1 + e)A/ 

cGC^ 


(29) 


Taking expectation on both sides of Inequality (28 1 over C[f — 
T.,max], and noting that expectation is a linear operator, we get 

EcGC- {(E[ I Q[f-r max], 

C[t - T^max] = C ]) X {2Qi[t - T/^max])} 7r(c) 


<K + T,i^ci‘^QAt - Ti,max]) 7r(c) (-ryz(c)) 

<k+j:i^cW- ]) (-(1 + e)Ai) (using ( p9l l) 

< K - 2eY. 1(1 c(QAt - n,max] h) 


That is, 

EcGC- {(E[ Ezg£ (T*) 1 Q[i - T.^ax], 

C[t - T^max] = C ]) X {2Qi[t - T;,max])} 7r(c) 
< K-2eY,ieciQAt-Ti,max]>^i) (30) 


From Inequalities ([24| and (301, we have 


E [ l/[f + 1] - V[t] I Q[f - T^max] ] 

— 7T 26 E;g22 QAt 2"/max] A/ 


(31) 


Since AT is a constant, the RHS in the above inequality 
is greater or equal to zero only for a finite number of 
states in the system state Markov chain Y“^[f], and lesser 
or equal to zero for the rest of the states in Y'^[f]. It now 
follows from Foster’s theorem (see 1T8)), that the system 
state Markov chain Y'^[f] is positive recurrent. Hence the H 
policy preserves the stochastic stability of the network for the 
given arrival rate vector A. Since A is any arbitrary vector 
in A such that (1 + e)A G A, it follows that the H policy 
preserves the stochastic stability of the network for all arrival 
rate vectors in the interior of A. 


Appendix K 

Proof of Proposition[T2] 

We first count the number of comparison operations 
in steps 0 0 0 [T 6 ] and when the number of links 
in ActiveSet is n. We note that ActiveSet has L links in 
the first round, and that its size reduces by 1 in each 
subsequent round. In step there are n — 1 comparisons 
for each of the n links in ActiveSet. Therefore, the total 
number of comparisons in step (for all rounds together) is 
A(L-1) + (A-1)(A-2) + ... + 32 =0{L^). Step0can 
be accomplished using n — 1 comparisons yielding a total of 
(A — 1) + (A — 2) + ... + 2 = 0{LA) comparisons. In step 
|7] after suppressing the row and column of any one of the 
n — 1 links in ActiveSet \ {H}, we need n — 2 comparisons 
for each of the remaining n — 1 links in ActiveSet \ {AT} 
to compute the values {Ti^^ax}ieActiveSet\{K}, where AT 
is the link whose row and column we suppress. We further 
need another n — 1 comparisons in the worst case to find 
whether eliminating this link reduces the delay for at least 
one channel, by comparing the n — 1 newly computed 
delays with those computed earlier in step This gives 
us (A - 1)((A - 1)(A - 2) + (A - 1)) + (A - 2)((A - 
2)(A - 3) + (A - 2)) + ... + 3.(3.2 + 3) + 2.(2.1 + 2) 
= 0{L'^) comparisons. In the LC-ELDR policy, the set 
EC has n — I elements in the worst case, and hence step 16 
can be accomplished with n — 2 comparisons for a total of 
(A — 2) + (A — 3) + ... + 1 = 0{L‘^) comparisons. In step 16 
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of the LC-ERDMC policy, the set EC has n — \ elements 
in the worst case, and hence, as in step after suppressing 
the row and column for each link in EC, we need n — 2 
comparisons for each of the remaining n — 1 links in EC 
to compute the values {Ti^rnax}ieActiveSet\{K}- We further 
need another n — 1 comparisons to find the number of links 
for which the newly computed delays are lesser than the 
corresponding delays computed earlier in step This gives 
us (L - 1 )((L - 1 )(L - 2 ) + (L - 1 )) + (L - 2 )((L - 2 )(L- 
3) + (L - 2)) + ... + 3.(3.2 + 3) + 2.(2.1 + 2) = ©(L^) 


comparisons. Step 24 contributes a constant to the runtime 
since there would only be two links left in ActiveSet. 

Next, we count the number of multiplications and additions 
required in steps and 24 in computing the queue-length 
weighted conditional expected values. Evaluation of the 
conditional expectation for each of the n links in ActiveSet 
requires C multiplications and C additions, and one additional 
multiplication is required for multiplying with queue-length. 
Hence, in all, L{2C + 1) + {L- 1)(2C -f 1) -f ... -f 2(2C -f 1) 
= 0 {CL‘^) multiplications and additions are required. 

Therefore, the cost of comparisons, multiplications and 
additions together is 0 {CL^ + L^). 


Appendix L 

Proof of Proposition[T01 


First, we need to set up some notations. Let N be the 
number of links in ActiveSet and T := {1, 2,..., N} be the 
set of links in ActiveSet before the call to Algorithm (recall 
that in the case of a network with complete interference, and 
for the first call to Algorithm in the case of a network with 
multiple interference sets, ActiveSet will have all the links in 
the network). Let 5ij for 1 < < (V be the delay value in 

row i and column j of the delay table (note that Su = 0 Vi 
in our model). Let Ck,T be the realization of channel-state on 
link Ik at time t — t. Further, let 


l{a} 


f 1 if a is true 
\ 0 otherwise 

riiKi 

iGl 


E{k,Tk) 


:= max{l{ai}}. 


i&I 


:=E 




We will call the “while” loop body from line 3 to line 
23, and also the computation in lines 24—29 in Algorithmic in 
Sec. VI a “round”. Thus, if the LC-ELDR policy terminates 
at line 26 or at line 28, then it would have executed N — 1 
rounds (specifically, N — 2 rounds in the body of the “while” 
loop, and the last round [round — 1] in lines 24-29). 

Let us call the link that has not yet been eliminated from 
the set of contending links, an active link. Let be the 

active link with the largest expected data rate in round r. That 
is. 


:= 


arg max 

k eT\{ U e 

l<n<r-l 


(") 


|E(fc,T, 




where and are as defined below. 

Let be the link other than that has the smallest 

expected data rate among all links that reduce the common 
delay value with which the channel state of at least one link 
can be accessed by the transmitter nodes of all the contending 
links, if this link is eliminated in round r (i.e., among all 
links that reduce the maximum of the delay values in a row 
for at least one row [corresponding to an active link] in the 
table of delay values [after suppressing the rows and columns 
corresponding to links that have been eliminated up to round 
r], if this link is eliminated in round r). Equivalently, 
is the link that was eliminated in round r (if r is less than 
the current round number), or the link that will be eliminated 
in round r (if r is equal to the current round number). We 
can write as in Expr. (32i. The computation of is 
illustrated in Figs.j^b) - (e) for r = 1, and in Figs.j^f) - (h) 
for r = 2, for the setting in Sec. |VI-A| Note that the condition 
^ 0 ” indicates that some link (other than the one with 
the largest expected data rate in round r) is eliminated in round 
r, and hence that the LC-ELDR policy can proceed to round 
r + 1 (i.e., round r is not the last round). 

We let (not to be confused with Ti{h) defined in Sec. 


II-D|i be equal to Ti.^raax (see Sec. II-D[ ) at the beginning of 
round r, where Ti.^max is calculated after masking the rows 
and columns pertaining to the links that have been eliminated 
in rounds 1 to r — 1 (as illustrated in Figs, [^a), [^e), and 
gh)). That is. 


(r) 


max 

k G T\{j, U e 


(n) 


{^jk} 

} 


Note that > • • • > 

With these definitions in place, we are now ready to derive 
the required analytical expression for the expected saturated 
system throughput of the LC-ELDR policy. Let i G T be 
the link chosen by the LC-ELDR policy in a particular time 
slot, say t. We now consider the working of the LC-ELDR 
policy when it is executed at the transmitter node of link i 
in time slot t. From the listing of the LC-ELDR policy in 


Sec. VI we see that link i can emerge as the “winner” (i.e., 
as the link chosen by the LC-ELDR policy for carrying 
transmission) in time slot t in two ways - (i) from lines 
25-26 (i.e., round — 1 is the last round), or (ii) from lines 
8-10, 14 (i.e., an intermediate round r < N — 1 is the last 
round). Thus, link i can emerge as the “winner” when the 
last round is any of 1 to A^ — 1. Now, link i will emerge as 
the “winner” when the last round is r, 1 < r < A^ — 1 , if 
it “survives” (i.e., if it not eliminated in) round 1 , survives 
round 2 , ..., survives round r. 

Let us hrst hx a particular last round r, l<r<W— 1 
(i.e, the LC-ELDR policy runs from round 1 to round r and 
terminates immediately after round r). When the last round 
is r, if link i emerges as the “winner”, its contribution to the 
total expected saturated system throughput is given by 
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jeT, qG{l,2,....r} 
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argmin^ |l+{(5jfc 

k€T\{ U 

l<n<r-l 


> max U e(")} 

mGT\{j,k, U i<n<r-i 


l<n<9—1 


X E(fc,T^’'^)|, if r < iV — 1, and !+{.} above 


evaluates to 1 for at least 


fceT\{ U 

l<n<r-l 


(32) 


0 otherwise 

^for the LC-ELDR policy; ^chooses the link with the largest k if more than one link have the same minimum value 


r—1 


^Ijlink i is not eliminated in round f} 


X Ijround f is not the last round} 
Ijlink i is not eliminated in round r) 
X Ijround r is the last round) 

{ClAt - T^'^] = C M } pGT\{*} 

ge{1.2.....r}J 

X Pr {Ci^ [t - , C/, [t - rf 


X 2 ) , 


(33) 


Now, consider a particular round f of the LC-ELDR policy. 
If link i has survived (i.e., has not been eliminated in) the 
first f — 1 rounds, then it will survive round f if one of the 
following happens: 

1 ) link i has the largest expected data rate in round f (hence 
i = H and therefore i ^ EC', recall that we set aside the 
link with the largest expected data rate in each round so 
that it will not be eliminated in that round [see steps 6 and 
7 of Algorithm[^). We denote this condition as s{i,f, 1). 

2 ) link i does not have the largest expected data rate in round 
r (hence i ^ H), and link i does not reduce the maximum 
of the delay values in a row, for any row (corresponding 
to a link in ActiveSet) in the table of delay values (after 
suppressing the rows and columns corresponding to links 
that have been eliminated up to round f) if it is eliminated 
in round f (hence i ^ EC', see step 7 of Algorithm[^, and 
some link (other than the one with the largest expected 
data rate in round f) reduces the maximum of the delay 
values in a row, for at least one row (i.e. link) if it is 
eliminated in round f (i.e., EC ^ (p). We denote this 
condition as s{i,f,2). 

3) link i does not have the largest expected data rate in round 
f (hence i ^ H), and link i reduces the maximum of the 
delay values in a row, for at least one row (corresponding 
to a link in ActiveSet) in the table of delay values (after 
suppressing the rows and columns corresponding to links 
that have been eliminated up to round f) if it is eliminated 
in round f (hence i £ EC), and link i does not have the 


smallest expected data rate among the links that reduce 
the maximum of the delay values in a row, for at least 
one row (i.e. link) if that link is eliminated in round f 
(hence i ^ S', see step 16 of Algorithm [T]i. We denote 
this condition as s{i,f,3). 


Now, s{i,r, 1) can be written mathematically as: 
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max {E(fc,r^’'^)}| 

I 1 Jn)^ J 


fcGT\{ U 

l<n<r—l 


k<i 


> 


max {E(fc,rf^)}| 

I 1 ^(n), J 


feGT\{ U 

l<n<r—1 
k>i 


(34) 


In the expression for s(i,r, 1) above, we note that splitting 
the comparison of the expected data rates into two - namely, 
(i) comparison with (the maximum of the expected data rates 
of) transmitters that are less than i, and (ii) comparison with 
transmitters that are greater than i, creates a lexicographic or¬ 
dering among the transmitters. This is required to consistently 
resolve the “winner” in case there is a tie in the expected data 
rates of multiple transmitters - we always resolve in favor of 
the smallest numbered transmitter (as a convention) in case of 
a tie. 

Next, s{i,r,2) can be written mathematically as: 
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max {E(fc,r^''^)}, 

fcGT\{ U 

l<n<7—1 
k<.i 


E(i,Tf^)< max {E(A:,t^’'^)}| 

, r I 1 J 


fcGT\{ U e^”^} 




l<n<r—1 
k>i 


X 1 i Sji < 


max 

meT\{i,j, U A-C} 
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U 

l<n<r—1 


( 35 ) 


X 9^0} 


Finally, s{i,r,3) 
l+|E(i,Tf^) 


can be written mathematically as: 

< max {E(A:,t^’'^)}, 

fcGT\{ U 

l<n<r—1 
k<i 
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E(i,rf^)< max {E(fc, 

, r I ( „(n)^ J 


feGT\{ U 


X > 


l<n<r—l 
k>i 


max 


iGT\{i,j, U e^”^} 

l<n<r—1 


{Sjm} I 


ien{». 

U e(")} 


l<n<r—l 


X ^z} 


(36) 


From Q, ([g, @ and the expression for the con¬ 

tribution of link i to the total expected saturated system 
throughput when link i emerges as the “winner” when round 
1 < r < TV — 1, is the last round becomes 


E E 


(i) 

j&T 


r—i 


f=i 


ne{l,2,3} 


X 7 ^ 0}^l{s(z,r, = O} 

|c7;, [f - =c (,)| 


pGT\{z} 

ijG{l,2,...,r}. 


N 


m—1 \ n—1 / 

where 7rm(c) is the steady-state probability of being in state c 

(fe) 

in the channel-state Markov chain of link m, and p)^ is the 
probability of reaching state j from state z in fc steps in the 
channel-state Markov chain of link wP^ 

Now, summing over all possible last rounds r = 1 to — 
1, and over all links z = 1 to TV, we get the expression for 
the expected saturated system throughput of the LC-ELDR 
policy to be 
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as required. 


Appendix M 

Proof of Proposition[TT] 

The expression for the expected saturated system throughput 
of the LC-ERDMC policy is the same as the expression in 
Proposition 10 except that we need to redefine e(’’\ For that, 
in addition to carrying over all the notations we established in 
Appendix we need the following notations: 

r 1 if a is true 
\oo otherwise 

iGl 


s+{k,r) := s+{(5jfc > 

max {'Jjm}} jGT\{fc, 

m€T\{j,k U e(”)} U A") 

l<n<r—1 l<n<r—1 


} 


For the LC-ERDMC policy, we redefine to be the link 
other than that has the smallest expected data rate among 
all links that reduce the common delay value with which the 
channel state of a link can be accessed by the transmitters 
corresponding to all the contending links, for the largest 
number of links, if this link is eliminated in round r (i.e., 
among all links that reduce the maximum of the delay values 
in a row for the maximum number of rows [corresponding to 
active links] in the table of delay values [after suppressing 
the rows and columns corresponding to links that have been 
eliminated up to round r], if this link is eliminated in round 
r). We can write redefined as in Expr. (37 1 . The required 
expression for the expected saturated system throughput of the 
LC-ERDMC policy is then obtained by following the rest of 
the arguments as in the proof of Propositionin Appendix]^ 


n—1 


^TTm(c) = 7r(c) and p 


Vm S £ in our model; see Sec, 


II-A 
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eWt := < 


argmin^ < loo{s+(fc,r) = max s+(fc',r)} 

kGT\{ U ^ A:'GT\{_y 


l<n<r—1 


l<n<9—1 


X E(A:,t^’'^)|, if r < N — I, and loo{-} above evaluates to 1 for at least 


onefcGr\{ U 

l<n<r-l 


0 otherwise 

^for the LC-ERDMC policy; ^chooses the link with the largest k if more than one link have the same minimum value 


(37) 



